arXiv:1501.06805v2 [math-ph] 3 Jun 2015 


ITP-UU-15/01 


A pedagogical introduction to quantum integrability 

with a view towards theoretical high-energy physics 

Jules Lamers 

Institute for Theoretical Physics 

Center for Extreme Matter and Emergent Phenomena, Utrecht University 
Leuvenlaan 4, 3584 CE Utrecht, The Netherlands 

j .lainers@uu.nl 


Abstract 

These are lecture notes of an introduction to quantum integrability given at the Tenth 
Modave Summer School in Mathematical Physics, 2014, aimed at PhD candidates and junior 
researchers in theoretical physics. 

We introduce spin chains and discuss the coordinate Bethe ansatz (cba) for a repres¬ 
entative example: the Heisenberg xxz model. The focus lies on the structure of the CBA 
and on its main results, deferring a detailed treatment of the CBA for the general M-particle 
sector of the xxz model to an appendix. Subsequently the transfer-matrix method is dis¬ 
cussed for the six-vertex model, uncovering a relation between that model and the xxz spin 
chain. Equipped with this background the quantum inverse-scattering method (qism) and 
algebraic Bethe ansatz (aba) are treated. We emphasize the use of graphical notation for 
algebraic quantities as well as computations. 

Finally we turn to quantum integrability in the context of theoretical high-energy physics. 
We discuss factorized scattering in two-dimensional QFT, and conclude with a qualitative 
introduction to one current research topic relating quantum integrability to theoretical high- 
energy physics: the Bethe/gauge correspondence. 
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1 Introduction 

Quantum integrability is a beautiful and rich topic in mathematical physics, lying at the in¬ 
terface between condensed-matter physics, theoretical high-energy physics and mathematics. 
Usually, a (quantum) statistical model is considered ‘solved’ if the ground states, elementary 
excitations, and various thermodynamic quantities are known. Quantum-integrable models pos¬ 
sess a deep underlying structure that often allows for exact computation of such quantities. At 
the same time several of these models are quite realistic, and theoretic results may be tested with 
experiments. Inevitably, then, the theory of quantum integrability is rather technical, which 
may obscure its beauty to newcomers. These notes aim to give a pedagogical introduction to 
quantum integrability and help the reader cross that hrst potential barrier. 

Historical overview. Quantum-integrable models emerged in two different branches of phys¬ 
ics. The hrst example came from quantum mechanics: the isotropic Heisenberg ‘xxx’ spin chain 
for (ferro)magnetism. In a seminal paper from 1931, Bethe solved this model using a method 
that now goes under the name of coordinate Bethe ansatz (cba), turning the problem of hnding 
the model’s spectrum into the problem of solving certain coupled equations, called the Bethe- 
ansatz equations (bae). In the subsequent decades Bethe’s work was developed further by 
others, and in the 1960s Yang and Yang applied the CBA to the more general ‘xxz’ spin chain. 

The second source of quantum-integrable models was statistical mechanics. Here the proto¬ 
type is the six-vertex or ice-type model for two-dimensional hydrogen-bonded crystals. In the 
late 1960s Lieb and Sutherland were able to solve the six-vertex model via the transfer-matrix 
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method — famously used by Onsager to tackle the 2d square-lattice Ising model in 1944 — to¬ 
gether with the CBA as in the work of Yang and Yang. This solution uncovered several striking 
similarities between the six-vertex model and the xxz spin chain, and shed light on the reason 
why these models could be solved. 

In the late 1970s these two stories were unified by the quantum inverse-scattering 
method (qism) developed by the ‘Leningrad group’ of Faddeev et al, and others. Using ideas 
from classical integrability and soliton theory, the QiSM provides an algebraic framework for 
quantum-integrable models, in particular yielding the bae via the algebraic Bethe ansatz (aba). 

Outline. These notes are organized as follows. Sections 2, 3 and 4 contain an introduction to 
quantum integrability, roughly following the above historical account. (The four-hour Modave 
lectures on which these notes are based covered most of this material.) The quantum-mechanical 
side of the story is treated in Section 2. We introduce spin chains like the xxx and xxz models, 
present the CBA for such models, and discuss the main results for the xxz spin chain. In Section 3 
we switch to the statistical-mechanical side. We introduce the six-vertex model, treat it using 
the transfer-matrix method and CBA, and provide the results. By examining the outcome more 
closely we uncover the correspondence with the xxz model. Equipped with this background, 
the QISM is developed in Section 4. This provides the precise relation between the xxz and 
six-vertex models and, via the ABA, allows us to rederive the results of the CBA for these models 
using a single computation. 

In Section 5 we move on to QFT and theoretical high-energy physics. After providing an 
overview of the various relations that have been found with quantum integrability, and a dis¬ 
cussion of factorized scattering in qft in two dimensions, we give a qualitative introduction to 
the Bethe/gauge correspondence as a recent example of such a relation. 

There are three appendices containing further details and background. In Appendix A we 
present the Yang-Yang function. The details of the CBA are worked out for the xxz spin chain 
in Appendix B. Finally, in Appendix C the ii-matrix of the six-vertex model is found. 

Although none of the material in these notes is new, this introduction is somewhat different 
from most other introductory texts. For example, in Sections 2.2 and 2.3 we focus on the 
conceptual basis and the physics of the CBA and its results rather than on computations. Still, 
the CBA is worked out not just for the two-particle sector but, following [1, §8.4], also for 
the general case in Appendix B. Our presentation of the transfer-matrix method and QISM 
in Sections 3.2 and 4 consistently exploits a graphical notation adapted from [2]. Though 
not always the most practical way to perform computations, this diagrammatic notation is a 
convenient way to understand what is going on algebraically. 

Further references. Many important topics in quantum integrability are barely touched in 
these notes; examples include Baxter’s TQ-method, the thermodynamic limit, correlation func¬ 
tions, and quantum groups. Luckily the literature on quantum-integrable models is extensive, 
ranging from introductory texts to very technical papers. The following references, here ordered 
alphabetically, have been useful for preparing these notes: 

• The renowned book by Baxter [1] gives a very detailed account of the CBA and the TQ- 
method for several quantum-integrable models in statistical mechanics, including the six- 
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vertex model. The notation is perhaps a bit old fashioned at times. 

• Faddeev’s famous Les Houches lecture notes [3] provide a good basis for the ABA and the 
XXX model. Some familiarity with quantum integrability may be useful. 

• Gaudin’s book [4] was recently translated into English. Amongst others the xxz spin 
chain and the six-vertex model are treated using the CBA, and the thermodynamic limit 
is studied. 

• Chapters 1-3 of the book by Gomez, Ruiz-Altaba and Sierra [2] treat the CBA and ABA 
for the xxz spin chain and the six-vertex model. The underlying quantum-algebraic 
structure is pointed out, though perhaps somewhat vaguely at times, and there are nice 
diagrammatic computations. 

• Chapters 0-2 of the book by Jimbo and Miwa [5] form a neat concise introduction to 
statistical physics, the xxz spin chain and the six-vertex model. Although the ABA is not 
discussed, the QISM is essentially treated in Sections 2.4-3.3 and 3.7. 

• Karbach, Hu and Muller [6, 7] have written a nice three-part introduction to the CBA for 
the XXX model, including a discussion of the low-lying excitations in the physical spectrum 
for both the ferromagnetic and antiferromagnetic regime. 

• The well-known book by Korepin, Bogoliubov and Izergin [8] contains a lot of information 
about the QiSM and its applications to correlation functions. The discussion of the basics 
is quite condensed. 

A standard reference for classical integrability and soliton theory is the book by Babelon, 
Bernard and Talon [9]. For more about the history of quantum integrability see e.g. [10]. Ex¬ 
perimental realizations of quantum-integrable models are described in [11]. Numerical methods 
for the XXX spin chain are discussed e.g. in [6, 7] and [12, Eds. 1 and 4]. For quantum groups 
see e.g. the chatty introduction [13, §1-6] and the mathematics books [14-16]. 

Acknowledgements. I thank the organisers of the Tenth Modave Summer School in Math¬ 
ematical Physics for giving me the opportunity to share my enthusiasm for quantum integrability 
with my peers. I am grateful to the participants of the school for their interest and questions. 
In preparing the lectures and these notes I benefited from discussions with G. Arutyunov, 
R. Borsato, W. Galleas, A. Henriques, R. Klabbers and D. Schuricht. 

I gratefully acknowledge the support of the Netherlands Organization for Scientific Re¬ 
search (nwo) under the vici grant 680-47-602. This work is part of the erc Advanced Grant 
no. 246974, Supersymmetry: a window to non-perturbative physics, and of the d-itp consortium, 
a program of the NWO funded by the Dutch Ministry of Education, Culture and Science (ocw). 
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2 Bethe’s method for the xxz model 


The pioneering work of Bethe on the one-dimensional Heisenberg model for ferromagnetism 
is one of the corner stones of the theory of quantum integrability. Although nowadays many 
quantum-integrable models can be tackled in more sophisticated ways, as we will e.g. see in 
Section 4, Bethe’s method remains a concrete and physical way to introduce the basic ingredients 
and obtain the main results. 


2.1 The xxz spin chain and its symmetries 

At the dawn of the 20th century Maxwell had formulated his laws describing the connection 
between electric and magnetic forces and optics, but the microscopic mechanism behind mag¬ 
netism was not understood. The advent of quantum mechanics brought new insights, and 
Heisenberg and Dirac independently showed in 1926 that Pauli’s exclusion principle leads to an 
effective interaction between electron spins of atoms with overlapping wave functions [17]. This 
exchange interaction formed the basis for an important model for ferromagnetism published by 
Heisenberg two years later [18] (see also [19, §8]). In one spatial dimension this is an example 
of a spin chain — a special class of quantum-mechanical models that are rather simple in their 
set-up, yet lead to a wide variety of interesting physics and mathematics. 


Spin chains. Consider a one-dimensional array of L atoms, modelled by a lattice of length L 
with uniform lattice spacing that we take equal to one. We impose periodic boundary conditions, 
so that the lattice is := Z/LZ. This choice of boundary conditions is very convenient, and 
not unreasonable since one is typically interested in the physics in the thermodynamic limit 
where L —)• oo becomes macroscopically large. 

The microscopic degrees of freedom are quantum-mechanical spins, see Figure 1. Thus each 
site I £ 'Ll comes with a finite-dimensional vector space Vi and a spin operator Si = {S^, Sf, Sf) 
on Vi satisfying the su(2)-relations. The periodic boundary conditions mean that Si+l = Si. 
We are interested in the case of spin 1/2: each Vi is a copy of with basis given by spin up 
and down, Vi = C|t); © and 5“ is represented via the Pauli matrices ct“ as usual. 


1 + 




Figure 1: One-dimensional spin chain of length L with spin 1/2 and periodic boundary condi¬ 
tions. Cartoons like this, where the spin vector at each site either points up or down, should of 
course be taken with a grain of salt: really the spins may point in any direction in C^. 

^With the thermodynamic limit in mind one should not really distinguish between two spin chains that only 
differ in the numbers of lattice sites, but rather think of a spin chain as a family of systems indexed by the 
length L G N\{1}. 


5 






The Hilbert space of the spin chain is the tensor prodnct of the Vi over the lattice, 

n=i^Vi, ( 2 . 1 ) 

IG'^l 

with (orthonormal) basis consisting of tensor products of the local spin vectors |t); and ||);. 
The subscript of Si keeps track of the factor V) in (2.1) on which this local spin operator acts 
nontrivially: 

5z=1(8)---(8)1(8)§<t(8)1(8)---(8)1. (2.2) 

1 l L 

This tensor-leg notation is used throughout the literature on quantum integrability and will 
be particularly helpful in Section 4. (Note that it does not make sense to use a summation 
convention for these subscripts as there is nothing special about precisely two operators acting 
nontrivially at the same V).) While we are at it, let us also introduce the following common 
notation. For any vector space W, ‘End(lT)’ denotes the space of all linear operators W —)■ W, 
i.e. square matrices of size dim(lF). For example: 5“ G End(V;) C End(F^) for a = x,y,z. 

The relations between the local spin operators can be packaged together into a ‘global’ spin 
Lie algebra governing the entire spin chain, 

K.sfl=iMy E (2,3) 

'y=x,y,z 


where the totally antisymmetric su(2)-structure constant is fixed by = 1. The relation (2.3) 
is sometimes called ultraloeal since the spin operators at different sites commute. For compu¬ 
tations it is convenient to work with the (sl(2) = su(2)c) ladder operators := Sf ± 15^^ 
together with Sf, satisfying 

K.5±l = ±MyS±. |S+ S,-| = 2MySf , [Sf.S±l=0. (2.4) 

With respect to the basis {|t)n \i)i} of Vi these operators are given by 


5+ = h 


0 1 
0 0 


sr = h 


/o 

0\ 


1 

o) ’ 



1 0 
0 -1 


(2.5) 


The set-up so far can be summarized in more mathematical terms by saying that a spin chain 
is a Hilbert space % as in (2.1) carrying for each I € an irreducible su(2)-representation; for 
us this is the two-dimensional (defining) representation (2.2). In fact T-L also carries a ‘global’ 
5u(2)-representation, given by the total spin operator S = {S^, , S^) defined as 

S°‘ := Sf‘ G End(Ff) , a = x,y,z . (2-6) 

This representation is reducible, as we will see in (2.17). 

Exercise 2.1. To practice with this notation, compute the matrix of with respect to the 
standard basis for % for L = 2 and L = 3. 


6 


The last piece of input is a (hermitean) Hamiltonian H G End(?^) describing the exchange 
interaction between the spins. We will need the following properties from these interactions: 
they are 

i) only nearest neighbour] 

ii) homogeneous, i.e. translationally invariant; and 
hi) at least partially isotropic, i.e. [S^,H] = 0. 

Exercise 2.2. Argue that any spin-chain Hamiltonian obeying property (i) can be written as 
H = Yli What does (ii) mean for the boundary conditions when L is finite? Try to find 

the form of the most general local contributions satisfying (ii)-(iii). 


Examples. The simplest spin chain satisfying (i)-(iii) is the Heisenberg ‘xxx’ model, 


Hxxx — ~J 'y ^ Si ■ Si-i-i , (2.7) 

where the exchange coupling J sets the energy scale. Since [S, Hxxx] = 0 this model is com¬ 
pletely isotropic and the spins have no preferred direction. Accordingly the spectrum is highly 
degenerate: the states come in an su(2)-multiplet for each energy eigenvalue. When J > 0 
the lowest energy is attained when each of the local terms in (2.7) contributes maximally, so 
the spins tend to align. This is the ferromagnetic regime studied by Bethe in 1931. In con¬ 
trast, for J < 0, the spins tend to anti-align and the macroscopic magnetization vanishes. This 
antiferromagnetic regime was first analyzed by Neel in 1948. 

A more general spin chain obeying properties (i)-(iii) is the ‘xxz’ or Heisenberg-Ising model, 

Hxxz = -JY1 ^ ’ ( 2 - 8 ) 

I^Ijl 

where A G M is the anisotropy parameter. This model was introduced by Orbach in 1958 and 
thoroughly studied by Yang and Yang in the 1960s (see [20] and references therein). In terms 
of the ladder operators (2.4) the Hamiltonian (2.8) reads 

^fxxz = -? Y{Tsr+i + srsii+ 2 Asfsf+i). (2.9) 

IG'^l 


This form clearly shows that the first two terms describe the hopping of excited spins while the 
third term counts the number of (mis)aligned neighbouring spins. 

Exercise 2.3. To get more feeling for the xxz Hamiltonian consider the summand in (2.9). By 
(2.2) and (2.5), we have e.g. oc (u^ (8) l)z,/+i(l <8> ® cr^)/^;+i. Use this to 

check that with respect to the standard basis of V) ® Vi+i 


— — 


^(T^r+i + srsi, 


_L 9 


h^j 




-A 2 
2 -A 


( 2 . 10 ) 


A/ 


1,1+1 


where zeroes are suppressed. 
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Exercise 2.4- Show that for L even it suffices to take J > 0 and A G M by using (2.4) to compute 
V-^ for V := S 21 G End('H). Which value of A corresponds to the antiferromagnetic 

XXX model in this way? 

Exercise 2.5. An external magnetic field in the z-direction can be included by adding —h J2i Sf 
to the Hamiltonian, preserving properties (i)-(iii). Show that it is enough to consider h > Ohy 
calculating W i^xxz(^) W~^ with W := Hi ‘S'f G End(??) the spin-flip operator. 

There exists a further generalization, the ‘XYZ’ model, which has a different coupling con¬ 
stant for each spin-direction a. This spoils property (iii) and the model cannot be treated using 
the Bethe ansatz (but see [1, §9-10]). 

Symmetries. Our goal is to find the spectrum of the xxz model, i^xxzl'k) = ElT). This 
will be achieved in Sections 2.2 and 2.3 using the CBA, and again in Section 4.3 with a more 
slick method. As always, the symmetries come to our aid, and we can exploit properties (ii)- 
(iii) to break our problem into smaller pieces. The following symmetries are at our disposal: 
translations along the lattice, by any amount of sites, and rotations around the z-axis, generated 
by S^. Thus the symmetry group is 

G = ZxU{l)z CZx SU{2) . (2.11) 

In mathematical terms these symmetries can be used to decompose Ti into a direct sum of 
irreducible G-representations, or ‘sectors’, which are preserved by the Hamiltonian. Let us see 
what this means concretely. 

Exercise 2.6. Without reading any further, find the consequence of partial isotropy for L = 2 by 
comparing the result of Exercise 2.1 with (2.10). What are the sectors corresponding to [/(l)z? 


M-particle sectors. Eirst we exploit the partial isotropy. H and can be simultaneously 
diagonalized by (iii), so the eigenvectors of form a basis for Ti in which the Hamiltonian is 
block diagonal. Let us show that it has the following form with respect to this basis: 


/ 


H = 


\ 


( 2 . 12 ) 


V 




The first block in (2.12) is 1 x 1 and corresponds to the pseudovacuum, which we take to be 

l^>:=(8)lt);=lt---t)e'W. (2.13) 

This vector happens to be a ground state of (2.8) if JA > 0, as we will see in (2.19), but the 
point is that |n) is an eigenvector of (with spin hL/2) and killed by all : it is a highest- 
weight vector. This makes it a suitable reference point for constructing all other S^-eigenvectors. 
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For example, the second block in (2.12) is obtained by flipping any single spin, 

= (2.14) 

1 l L 

producing L vectors (so the block has size L x L) with spin L/2 — 1. Likewise the third block 
corresponds to flipping yet another spin; since = 0 and this yields ( 2 ) 

different vectors \k,l) for 1 < k < I < L. 

In general, by repeatedly applying lowering operators Sf to |11) we construct an orthonormal 
basis describing configurations with 0 < M < L flipped spins: 

i <ii < ■■■ < Im < l . ( 2 . 15 ) 

This is the coordinate basis of T-L, which is responsible for the ‘coordinate’ in ‘cba’. A nice 
aspect of this basis is that it is very physical; its elements can be depicted as in Figure 1 (for 
which M = 5). The price we pay is that we lose manifest periodicity by restricting ourselves to 
the ‘standard domain’ 1 < li <■■■< Im < L to avoid overcounting. Consequently the periodic 
boundary conditions Si^l = Si must be imposed explicitly when working with the coordinate 
basis. This will be important in Section 2.2 and Appendix B. 

From (2.4) it follows that all spin configurations (2.15) are eigenvectors of the total spin-z 
operator: 

|/i, ...,1m) = h (L/2 - M) |/i, ...,1m) . (2.16) 

Let us write T-Lm ^ Td for the M-particle sector consisting of all vectors with M spins down. 
The (weight) decomposition of our Hilbert space into these subspaces, 

L 

H = 0 , (2.17) 

M=0 

corresponds to the block-diagonal form of H in (2.12). 

Exercise 2.7. Compute the size of the Mth block in (2.12). Check that the dimensions on both 
sides of (2.17) agree. 

The upshot is that partial isotropy allows us to focus on diagonalizing the Hamiltonian in 
the M-particle sector: our new goal is to solve the eigenvalue problem 

Hxxz\'^ m) = Em\'^ m) , I'I'm) £ 77m • (2.18) 

Magnons. Next we exploit the homogeneity; let us see how far that gets us. By (ii) the 
Hamiltonian satisfies UHU~^ = H where the shift operator U G End(77) shifts all sites to the 
left, mapping each Vi to In analogy with continuous (as opposed to lattice) models 

one often writes U =: e’^. Since U is unitary its eigenvalues are of the form e^^ for some 

^In Section 4.1 we will see that defining the shift operator as acting by translations to the right would be more 
natural from the viewpoint of the QISM, cf. (4.13). However, using that convention would result in a sign in the 
exponent in (2.20), and similarly in e.g. (2.26) and (2.29) for higher M. At any rate, this choice of convention 
essentially only affects the sign of the (quasi)momentum; of course the physical results do not depend on it. 
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real momentum p. (Note that p is defined mod 27r.) Periodic boundary conditions imply that 
= 1 is the identity operator on %, leading to momentum quantization p G as expected 

for particles on a circle. 

Exercise 2.8. For the zero-particle sector %q use homogeneity to hnd the momentum of the 
pseudovacuum (2.13). Check that Ffxxz = .Fo |^) with ‘vacuum’ energy 

Eq = -h^JAL/A . (2.19) 


The one-particle sector is hxed by homogeneity as well. Indeed, any vector in T-Li can 
be expressed in terms of the coordinate basis. Translational invariance means that = 

A>p{l) |/) should be an eigenvector of U for some momentum p. Using = U~^ it follows 
that the wave functions satisfy the recursion Afp{l -|- 1) = {l\U\Afi]p) = EPA>p{l), yielding a 
plane-wave expansion: 


Ti;p) 


1 

7z 




( 2 . 20 ) 


Exercise 2.9. Use {k\l) = 6k^i to check that the = L = dim(Ffi) vectors (2.20) constitute 
an orthonormal basis for Hi. 


The basis vectors (2.20) describe excitations around the pseudovacuum |n) called magnons: 
spin waves with quantized wavelength 2Tr/p travelling along the chain. With respect to the 
magnon basis (2.20) for Hi any translationally-invariant Hamiltonian is diagonal in the one- 
particle sector. Whether or not magnons are ‘quasiparticles’ describing low-lying excitations in 
the physical spectrum depends on the model’s parameters. 

Exercise 2.10. Compute the action of (2.9) on (2.14) to check that the dispersion relation of a 
magnon in the XXZ spin chain is 


£i{p) := Ei{p) - Eq = h^J (A - cosp) . 


( 2 . 21 ) 


Notice that for the XXX case (A = 1) the magnon with vanishing momentum contributes 
zero to the energy. This is a direct consequence of the symmetries: i^xxx has full rotational 
symmetry, so its eigenstates come in su(2)-multiplets. The zero-momentum magnon is simply 
the first su(2)-descendant of the pseudovacuum: |4'i;0) oc S'"!!!) with S~ = Yli total 

lowering operator. Turning on the anisotropy lifts the degeneracy in the spectrum. 

For M > 2 we can again write any translationally-invariant vector with momentum p as 

I^m;p) = 'ilpih,-■ ■ ,lM)\h, ■ ■ ■ Jm) ^'Em (2.22) 

with respect to the coordinate basis (2.15) of Hm- This time, however, the wave functions A>p{l) 
are not completely determined by the symmetries (2.11).^^ To diagonalize the larger blocks of 
the Hamiltonian we have to be smart: this is where the Bethe ansatz comes in. 

^For example, expand |4'2;p) G 4^2 as in (2.22). Homogeneity again recursively relates wave functions for 
excited spins at equal separation di2 ■= d(h,b), where d{k,l) := min„gz \l — k + nL\ is the distance function 
(metric) on Z^. Indeed, A>p{h + l,l 2 + 1) = {li,l 2 \U\A> 2 -,p) = A>p{h,l 2 ), so A<p{li,l 2 ) = %idi 2 ) = 
^iph iiy"(cij^ 2 ) for some function th), (or equivalently A/”) depending on the lattice only through the separation 
between the two flipped spins. However, the values of this function for different di 2 are not related by the 
symmetries (2.11). 
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2.2 The coordinate Bethe ansatz 


In a ground-breaking paper from 1931, Bethe solved the ferromagnetic regime of the xxx spin 
chain [21]. This subsection introduces the coordinate Bethe ansatz (cba) for any spin chain 
obeying properties (i)-(iii) above. The focus lies on the structure and physics of the method; 
computational details can be found in Appendix B. 

The basic idea of the CBA is to parametrize the states in the M-particle sector via para¬ 
meters Pra, 1 < m < M. In the simplest case this boils down to the result (2.20), while in 
general it is a symmetrized version of the Fourier transform. The spectrum of the Hamiltonian 
follows once the values of the pm are found from a system of coupled nonlinear equations: the 
Bethe-ansatz equations (bae). Thus, in principle, the CBA provides a concrete and physical way 
of converting the problem of diagonalizing the Hamiltonian to that of solving the BAE. 

Trials for M = 2. To understand where the CBA comes from we first consider the case M = 2. 
Expand \^ 2 ]p) as in (2.22). Note that we only have to dehne the wave function ^p{h,l 2 ) 
for 1 < li < I 2 < L. Inspired by the result (2.20) for M = 1 a reasonable first guess is to include 
two parameters, pi and p 2 , and try 

= '<i>p,{h)^p,{l2) OC , h<l2, (2.23) 

where an overall normalization, not depending on the lattice sites, is suppressed. The periodic 
boundary conditions require 

^pi,P2(^2j^i + A) = Tp^_P2(/i,Z2) , 1 </i < ^2 < A . (2.24) 

where for (2.23) the left-hand side is given by 

gip2i ^i{pih+P2h) _ (2.25) 

Exercise 2.11. Check that for (2.23) the only solution to (2.24) is pi = P 2 = 0. 

To improve our guess for the wave functions we notice that (2.25) describes two excited spins, 
like (2.23), but with the positions Im of the excitations with parameters pm interchanged. Thus 
(2.25) can interpreted as the result of scattering of the two excitations from (2.23). Correcting 
our initial trial to take into account such scattering we thus try A e'(^’hi-i-p 2 h) _j_^' gi(pi^ 2 +P 2 h)_ 
Through (2.24), periodicity now relates the pm to the coefficients as e'^^^ = N jA and e'^^^ = 
Aj a!. Setting A = A! would result in the pm each taking values as for a single, free, magnon. 
To allow for interactions between the flipped spins we promote the coefficients to functions: 

= A{p^,p2) +A'(pi,P 2) , /i < ^2 • (2.26) 

This is the ansatz (hypothesis, educated guess) proposed by Bethe for the two-particle sector. 
Exercise 2.12. Check that the vector |'I' 2 ;pi,P 2 ) given by (2.26) has momentum p = pi + P 2 - 
The two-body S-matrix (which, despite its name, is just 1x1) 
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describes the scattering of the two excitations, as can be seen by rewriting (2.26) in the form 
■^pup,{li,l2) OC ei(Pih+P2b) +5 (pi,P 2) ei(Pi^2+P2h), 

The periodic boundary conditions (2.24) impose two equations: 

= S{pi,p2) , e'P2i = S{pi,p2)~^ . (2.28) 

These are the bae for the parameters pm in the two-particle sector. Physically they say that 
when either of the excited spins is moved once around the chain (in clockwise direction) it 
scatters on the other excitation. Note that the bae together imply the periodicity condi¬ 
tion = 1 and hence momentum quantization p = pi + P 2 £ for the two-particle 

sector. The dependence on the details of the spin chain are hidden in the two-body S'-matrix 
on the right-hand side of the BAE. Until a model is specified we cannot say whether the bae 
have the right amount of, or even any, solutions for the pm- We will look at the results for the 
xxz model in Section 2.3. 

Before proceeding to the M-particle sector let us quickly recap our notation for M = 2. We 
write ‘|'I' 2 )’ for an arbitrary vector in the two-particle sector, ^\^ 2 ]pY for any vector in %2 that 
is translationally invariant (with momentum p), and ‘|'I' 2 ;pi,P 2 )’ for the specific vectors (with 
parameters pi and P 2 ) given by (2.26). 

CBA for general M. Expand £ 'Hm in terms of the coordinate basis \li, - ■ ■ ,1m) as 

in (2.22). Again we associate to each excitation a parameter p^ that is to be determined. 
We abbreviate I := [li, ■ ■ ■, Im) and p := (pi, • • • ,Pm)- 

By property (i) above we only have (very) short-ranged interactions, so the M excited spins 
do not interact when they are well separated, i.e. when no two excitations are next to each 
other. Thus, for such well-separated configurations it makes sense to look for a wave function 
of the product form 'I'pj(/i) • • • 'i’pj^filu) oc exp(ip- 1): this is the generalization of (2.23) to the 
Af-particle sector. 

Next, as for M = 2, we include all scattered configurations, labelled by permutations in Sm 
describing the ordering after the scattering. A linear combination of these Ml configurations, 
with coefficients depending on p to account for interactions between the excitations, gives the 
CBA for the (unnormalized) wave function in the Af-particle sector: 

Tp(0 = ^ A^{p) e‘P--' , h<---<lM ■ (2.29) 

ttGS'm 

Here p^ is a shorthand for the (right) action of vr G Sm on p; concretely this just means that 
p^ - I = YlmPTr{m) ^m- The ansatz (2.29) is also referred to as the Bethe wave function. As a 
check we note that (2.29) reduces to the magnon (2.20) when Af = 1, while for M = 2 the sum 
in (2.29) runs over two elements, the identity and a transposition, correctly reproducing (2.26). 

Strategy. The Bethe wave functions (2.29) yield eigenstates of the Hamiltonian in the M- 
particle sector if we can solve the equations 

{l\Hxxz\"^M',p) = Em{p)^p{1) , 1 < /i < • • • < Im ^ L . (2.30) 
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The unknowns are the energy eigenvalues Em{p) and the coefficients At^{p) (up to an overall 
normalization) as functions of p, together with the actual values of the p. The strategy to 
determine these consists of three steps: 

1. Solve the equations in (2.30) for configurations I with well-separated excitations. This is 
quite easy and will yield the M-particle dispersion relation £m{p) •= Em{p) — Eq. 

2. The equations in (2.30) with at least one pair of neighbouring excitations in 1. Although 
this is more tricky, it can be done, giving the coefficients At^{p)/A e{p)■ 

3. Impose the periodic boundary conditions. This will result in one equation for each expo¬ 
nent in (2.29): the BAE, a priori M! in total, for the allowed (‘on shell’) values of p. 

Of course this is really only half of the work: the bae still have to be solved, which has 
not been done for general M and L, and one has to let L —)• oo to study the thermodynamic 
properties of the model. In addition it remains to be seen whether all eigenstates are of the 
Bethe-form, so that the CBA does really produce the full spectrum. The above strategy is carried 
out for the M-particle sector of the xxz model in Appendix B; let us turn to the results. 

2.3 Results and Bethe-ansatz equations 

For brevity we set h = J = 1. This essentially only affects the energy eigenvalues, which can be 
restored by multiplication by h?J. 


Results for M = 2. We first collect the results of the strategy in the two-particle sector. The 
computations can be found in Appendix B. 

Step 1. For the well-separated case the equations (2.30) are satisfied by the CBA (2.26) 
provided the energy is given by the dispersion relation 

e 2 (pi,P 2 ) = 2 A - cospi - cosp 2 = ei(pi) + ei(P 2 ) • (2-31) 


This is a nice result: the energy consists of two contributions, one from each of the magnons. 
Although the values of the pm remain to be determined, and will be different from the free 
(single-magnon) case to give rise to an interaction energy, this means that the excited spins 
in the two-particle sector behave like magnons. In particular, the two parameters pm can be 
interpreted as the quasimomenta of these magnons. 

Step 2. For neighbouring excitations the equations can be solved using a trick, see Ap¬ 
pendix B, yielding the two-body S-matrix (2.27): 


S{pi,P2) = 


1 - 2 Ae‘P2 + e'(Pi+P2) 
1 - 2 Ae'Pi -\-e^ipi+P2) 


(2.32) 


Two-body scattering is unitary by virtue of the property \S{pi,p 2 )\‘^ = 1 for pm G K- 
This property, sometimes called physical unitarity, suggests defining the (real-valued) function 
0 (pi,P 2 ) := — ilog<S'(pi,p 2 ) known as the two-body scattering phase. 

The result (2.32) has two important physical implications. As S{p 2 ,pi) = S{pi,p 2 )~^ the 
Bethe wave function 'I'pi,p 2 (^i) ^ 2 ) is symmetric in the two pm upon normalizing (2.26) by 
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^{Pi^P 2 ) = S{pi,p 2 )~^^‘^■ Thus \^ 2 ]Pi,P 2 ) = \^ 2 '-,P 2 -,Pi)'- the magnons obey Bose-Einstein 
statistics. Interestingly, (2.32) also satisfies the fermion-like property S{pi,pi) = —1. Therefore 
\^ 2 ',Pi,Pi) = 0; yielding a Pauli exclusion principle for the quasimomenta.However, there is 
no spin-statistics connection in 1 -|- 1 dimensions, so these two properties are compatible. 

Step 3. The remaining task is to find equations for the values of the pm- Plugging (2.32) 
into (2.28) we obtain the bae for the two-particle sector of the xxz model: 

_ 1 - 2 AeiP2+ei(pi+P2) 

® “ 1 - 2 A e^Pi -F ehPi+P2) 

^ l-2Ae^Pi+ehPi+P2) 

1 - 2 Ae^P2 +ehPi+P2) 

Taking the logarithm shows that these are just quantization conditions for the quasimomenta: 

Lpi = 27rli - 0(pi,p2) , Lp2 = 27r/2 + 0(pi,P2) , , (2.34) 

where the Im are known as the Bethe quantum numbers. Thus the quasimomenta in the two- 
particle sector are no longer the ‘bare’ quantities (valued in ^Z^) of a free theory: (2.34) shows 
that they are ‘renormalized’ as a result of interactions between the two magnons. 

Exercise 2.13. Find an expression for Q{pi,P 2 ) by multiplying the numerator and denominator 
in (2.32) by and using log = 2iarctan 

Exercise 2.14. Check (2.31) and (2.32) without consulting Appendix B. 

Two-particle spectrum for A = 1. To see whether we have succeeded in diagonalizing the 
Hamiltonian for M = 2 one has to check whether the bae admit dim(Ff 2 ) = ( 2 ) solutions giving 
rise to linearly independent states. To get some feeling for the physics we briefly discuss the 
spectrum that Bethe found for the ferromagnetic XXX model. More details, including some nice 
plots, can be found in [6] (for the antiferromagnetic case see [7]). The solutions fall into three 
classes: 

i) There are L solutions describing a superposition of two free magnons, pi = 0 and p = 

P 2 = 21 x 12 !L. These can also be understood as the su(2)-descendants of the states in the 
one-particle sector, h) oc S |Ti;p). 

ii) The remaining solutions 0 < < p 2 < 27r can be interpreted as nearly free superpositions 

of magnons whose interactions vanish for L —)• 00 . Together with the first class these are 
seattering states. 

However, there are not enough of these solutions. To find the remaining states the CBA has to 
be improved by extending the quasimomenta to eomplex values, pm G C. Unitarity of the shift 
operator U = e'^ requires the total momentum p = pi P 2 to remain real. These account for 
the third class of solutions: 

^Anticipating the Pauli exclusing principle some authors include a sign in the CBA from the start, replacing 
(2.29) by 4/p(Z) = 


— p-i©(Pl>P2) 


= e 


(2.33) 


= g-i©(P2,pi) 
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iii) Quasimomenta with Im(pi) = — Im(p 2 ) / 0 cause ^2)1 to decrease with increasing 

separation between the magnons. These solutions can be interpreted as bound states. 


Such results were confirmed by neutron-scattering experiments, also for other models solvable 
by Bethe-ansatz techniques, see e.g. [11]. 

Let us briefly comment on a source of possible confusion. The appearance of the second 
su(2)-descendent of the pseudovacuum in the M = 2 spectrum (with pi = P 2 = 0) may appear 
to conflict with the Pauli exclusion principle for the two-body S-matrix. This issue is resolved 
by noticing that in the isotropic case the two-body S-matrix, 


^ M _ 1-2 + ei(Pi+P2) _ cot - cot f + 2i 

[Pi,P 2 )\a=i - ^ _ 2 _^gi(pi+p 2 ) “ cot ^ - cot ^ - 2i 


(2.35) 


is not continuous at the origin. Indeed, along the diagonal in the quasimomentum plane we 
have S{pi,pi) = —1, cf. the Pauli exclusion principle. However, along either axis (2.35) satisfies 
>S'(pi,0)1 A=i = 'S'(0,P2 )|a=i = +1, so the 5u(2)-descendants do not vanish. This discontinuity 
hints at the fact that the xxz model with ‘generic’ anisotropy A ^ ±1 is mathematically better 
behaved than the isotropic spin chain (cf. Appendix A). From this perspective A can be seen 
as a regulator for the xxx spin chain. 

Exercise 2.15. Check that, as A —?■ 1, the result of Exercise 2.13 matches the expression for 
0 (p1)P2)|a=i obtained directly from (2.35). 


Results for general M. Working out the strategy for the general M-particle sector, see 
Appendix B, gives the following results. 

Step 1. The equations (2.30) are solved by the CBA (2.29) for well-separated excitations if 
the contribution to the energy is 

M M 

SMip) = M A - cos Pm = ^ £l{Pni) ■ (2.36) 

m=l m=l 

Thus, the dispersion relation behaves additively for general M as well: the energy splits into 
separate contributions for each pm. Let us stress once more that this does not mean that (2.36) 
is simply the sum of free-magnon contributions; the quasimomenta pm are determined by the 
BAE to account for the interaction energy. At any rate, (2.36) does justify our quasiparticle 
interpretation for all M, so that we may conclude that M counts the number of magnons. In 
particular, since the T-Lm are preserved by the Hamiltonian, the magnon-number is conserved: 
there is no magnon production or annihilation. 

Step 2. The coefficients At^{p) in the Bethe wave function (2.29) also have to satisfy (2.30) 
for one or more pairs of neighbouring excitations. For general M there are many more such 
equations than unknowns, but it turns out that they can in fact be solved. Up to an overall 
factor the are expressed in terms of the two-body 5-matrix (2.32): 

n S{pm,Pm') ■ (2.37) 

l<m<m'<M 

s.t. 7r(m)>7r(m') 


A^jp) 

Ae{p) 
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This is a very remarkable result: physically, (2.37) says that M-magnon scattering is two-body 
reducible, i.e. factors into successive two-magnon scattering processes, governed by the two-body 
5-matrix (2.32). This is a extremely useful aspect of the model; we essentially know everything 
about the many-body scattering of magnons once we understand two-magnon scattering. Mod¬ 
els exhibiting this property are rather special. Indeed, note that if the scattering were one-body 
reducible, magnons would move freely along the spin chain. The xxz model, with two-body re¬ 
ducible scattering, is just one level up in complexity, allowing one to study interesting dynamics 
in a controlled setting. 

The results that there is no magnon-production and that scattering is two-body reducible 
hint at the existence of hidden symmetries and conserved quantities that render the xxz spin 
chain quantum integrable, see Section 5.1. This is indeed the case; we will find these conserved 
quantities in Sections 3.3 and 4.1. 

There is a nice graphical way to understand the result (2.37) [22]. Depict the initial and 
final configurations of quasimomenta (magnons) in the scattering process vr G Sm as follows: 

filial. Ptt^i) P-k{2) ■ ■ ■ Ptt(m) 


initial: pi p 2 ■ ■ ■ Pm 

Now connect equal quasimomenta by arrows in such a way that there are no points where three 
or more lines meet. Typically there are several ways in which this can be done; these (must and 
do) give the same result. For every pair n < m that is switched by tt there is a crossing X of pn 
and Pm, contributing a factor of S{pn,Pm) to For example, the coefficient At^ describing 

a three-magnon scattering process is depicted in Figure 2. 

Exercise 2.16. Rewrite the result (2.37) in terms of the two-body scattering phase for the 
normalization Ae{p) = Y\m<m' 

Exercise 2.1 7. Prove the Pauli exclusion principle for the quasimomenta in the M-particle sector 
by showing that (2.29) with (2.37) vanishes whenever pm = Pn for some m ^ n. 



Figure 2: Diagrammatic representation of (the coefficient A^,- describing) three-body scattering 
where magnons 1 and 3 are interchanged. There are two ways in which this can be done, so the 
second equality expresses a consistency condition — which is trivially satisfied since the two-body 
5-matrix is 1 x 1. 


Step 3. Finally periodic boundary conditions must be imposed on the Bethe wave func¬ 
tion (2.29) with coefficients (2.37). It turns out that there are M independent bae for the 
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quasimomenta p in the M-particle sector; 


M 

S’(Pn,Pm) , l<m<M. (2.38) 

n=l 

n^m 

Like (2.28) these have a nice physical interpretation. Since pm is the quasimomentum of the 
mth magnon, e'^™^ is the phase acquired by that magnon as it is moved once around the spin 
chain (in clockwise direction). The bae (2.38) say that this phase consists of contributions due 
to scattering on the other magnons. 

To sum up, under the assumption that all states are of the Bethe form, the CBA converts 
the problem of diagonalizing the Hamiltonian in the M-particle sector to that of solving the 
M coupled equations (2.38) for p G C^. The bae are rather complicated (see also Appendix A), 
but that was to be expected: no approximations were used to obtain them; they are exact. The 
BAE can be studied numerically as well as analytically. By plugging in the resulting on-shell 
values for the pm one finally obtains the actual eigenstates [Tm) and their energies. 

Since the bae are usually hard to solve one may wonder what we have gained by all of 
this. Notice that, although the bae become more complicated as M increases, they are not 
so sensitive to the length of the spin chain, unlike the size (^) of the M-particle block of the 
Hamiltonian. This renders them useful for studying the ground states, elementary excitations 
and several thermodynamic quantities even as L —>■ oo (under certain assumptions on the nature 
of the solutions in that limit). 

Rapidities. To conclude this subsection we introduce alternative variables that are in some 
sense more natural than the quasimomenta pm- Indeed, due to the factorization of magnon 
scattering the two-body S'-matrix plays an important role in the analysis of the model. Thus it 
is convenient to switch to coordinates for which the two-body S'-matrix takes a simple form. 

For the xxx spin chain, (2.35) suggests defining rapidities as 

Am := ^ cot ^ (2.39) 

so that the two-body S-matrix simply depends on the rapidity difference (cf. Section 5.1), 

S(A„,A„)U.l = . (2,40) 

(Depending on the context, rescaled or shifted versions of these are also used in literature.) 
Exercise 2.18. Invert (2.39) and use (2.21) to check that the quasimomentum and energy con¬ 
tribution of a magnon with rapidity A are 

J>(A)lA.i = bogL!M . E,(A)|i.i = . (2,41) 
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We conclude that in terms of rapidities the bae (2.38) for the M-particle sector of the xxx 
spin chain read 


Am + i/2 


M 

n 


Am ~ An + i 


Am 1 Am An i 

' ^ n=l 

nj^m 


1 < m < M 


(2.42) 


In the regime |A| < 1 the xxz spin chain involves hyperbolic generalizations of the above 
expressions: parametrizing the anisotropy as A = cosy we have 


1 sinh(A + i7/2) 
p(A) = - log ~ , 

1 smn(A — 17 /i) 


1 


^l(^) = hA 


sin^ 7 


2 sinh(A + 17 / 2 ) sinh(A — 17 / 2 ) ’ 


and the BAE become 
'sinh(Am + 


i7/2)7 

17 / 2 ) / si 


sinh(Am - An + iy) 


sinh(Am - 17 / 2)7 sinh(Am - An - iy) 

n^m 


1 < m < M 


(2.43) 

(2.44) 

(2.45) 


Exercise 2.19. Compute p'{\m) and compare the result with ei(Am)- (We will understand where 
this relation comes from in Section 4.3.) 

Exercise 2.20. Note that (2.43)-(2.45) become trivial in the isotropic limit. Check that the 
equations for the XXX spin chain can nevertheless be recovered by rescaling the rapidities as 
A I—)• yA before taking the limit 7 —)• 0. 

3 Transfer matrices and the six-vertex model 

Before turning to the abstract algebraic but very powerful formalism to treat the xxz model 
once more in Section 4 it is insightful to switch to the world of classical statistical physics on a 
two-dimensional lattice. We focus on the six-vertex model, which will turn out to be intimately 
related to the xxz spin chain. Several concepts that we encounter along the way will play an 
important role in Section 4 too. 

3.1 The six-vertex model 

Any lattice can be turned into a statistical model by assigning some microscopic degrees of 
freedom to the lattice and specifying a rule C 1 —)■ w{l3,C) that gives for each microscopic 
configuration C a temperature-dependent weight w{t, C), where r := ksT as usual. Often these 
are Boltzmann weights, wb{t,C) := exp(—S(C')/r), and the energy E{C) of a configuration 
determines its weight. The main object in statistical physics is the partition function 

Z{t) = (3.1) 

c 


18 








governing the statistical properties of the model. 

A well-known class of examples are Ising models, where the microscopic degrees of freedom 
are discrete ‘spin’ variables £i = ±1 at the vertices of the lattice, labelled by I, and the weights 
are determined by the energy E{C) of the ‘spin’ configurations C = {e;};. These models 
describe molecules with highly anisotropic interactions in crystals. A discussion of several 
exactly solvable Ising models can be found in [1]. 

Exercise 3.1. Check that the (anti)ferromagnetic Ising model is in fact obtained from the xxz 
spin chain in the anisotropic limits A —>■ Too. 

More generally spin chains from Section 2.1 also fit within this formalism. The lattice is 
and the local degrees of freedom are the quantum-mechanical spins in the local spaces V; ~ C^. 
For a(n) (eigen)configuration C G H = 0; V/ of spins the energy E{C) is the eigenvalue of 
the Hamiltonian H, from which we hnd the corresponding Boltzmann weight. The partition 
function arises as a trace over H: 


Z(r) = tr exp(—iI(C)/r) . (3-2) 

For any model the goal is to get a grip on the typically huge sum in (3.1). Indeed, interesting 
thermodynamics, like phase transitions, is related to non-smooth behaviour of Z in 1/r. The 
weights usually depend smoothly on the temperature, so this can only occnr in the limit where 
the lattice becomes infinite. For some statistical models there are methods that, in principle, 
allow for an exact evaluation of (3.1). The six-vertex model is an example of such an exactly 
solved model, and as we will see it can be tackled with the CBA from Section 2.2. Since 
the thermodynamics will not be relevant for us the dependence on the temperature r will be 
suppressed from now on. 

Vertex models. The model that we are going to study is an example of a vertex model in 
two dimensions.Consider a finite square lattice consisting of L rows and K columns, with 
uniform lattice spacing. We impose periodic boundary conditions in both directions, yielding a 
discrete torus Z^ x Z^. The microscopic degrees of freedom are ‘spins’, as for Ising models, bnt 
this time they are not assigned to the vertices of the lattice but rather to the edges, as shown 
on the left in Figure 3. 

For a vertex model the weight of a configuration C on the entire lattice is obtained as the 
product of vertex weights w{C, v) assigned to the vertices v of the lattice: 

w{C) = w{C,v) . (3.3) 

'hGZl xZx 

For nearest-neighbour interactions the vertex weights only depend on the four ‘spins’ surround¬ 
ing the vertex. If, in addition, the model is homogeneous (translationally invariant in both 

®In contrast to the quantum-mechanical spin chains, classical statistical models on a one-dimensional lattice 
are usnally not so interesting. Instead, the intriguing systems in statistical physics live in two spatial dimensions. 
Actually, since 1 -|- 1 = 2 -|- 0, this is not very surprising: through the (time-dependent) Schrodinger equation 
spin chains are really (1 -|- l)-dimensional, whereas time does not play a role for statistical models in thermal 
equilibrium. In Section 5.1 we will see that 2d is also special in QFT. 
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Figure 3: Example of a configuration of microscopic ‘spins’ e = ±1 on the edges in a portion 
of a two-dimensional lattice. On the left the ‘spins’ are indicated by arrows, with f and —)■ for 
e = — 1 and and ^ for e = -|-1; on the right these values are represented by a dotted and thick 
line, respectively. 


directions) the vertex weights can be denoted as follows: given ‘spin’ variables a, /S, 7 , 5 G {±1} 
on the four edges surrounding v as shown in Figure 4 we write w{C,v) = There are 

sixteen vertex weights that have to be specified, corresponding to the possible configur¬ 

ations of the ‘spins’ on the surrounding edges. 


7 

/?- 

a 


S 


Figure 4: A vertex v G x Zk with ‘spin’ variables a,P,j,6 G {±1} on the surrounding 
edges. 


Main example. The six-vertex or ice-type model describes hydrogen-bonded crystals. The 
vertices of the lattice represent larger atoms, oxygen in the case of water ice, and the edges 
model hydrogen bonds. (The square lattice is a reasonable two-dimensional approximation of 
the hexagonal structure of ice crystals found in nature, depicted in Figure 5.) The ‘spin’ on 
the edge encodes at which end of each bond the proton is, say with ‘spin’ —1 corresponding 
to the right (top) of a horizontal (vertical) edge as in Figure 3. For electric neutrality each 
oxygen atom should have precisely two hydrogen atoms close by. This translates to the ice rule 
a -\- P = 'j -\- 6, which leaves us with the six ‘allowed’ vertices shown in Figure 6 . For example, 
in Figure 3 the ice rule is only satisfied for the two vertices on the right. 

In addition ‘spin’-reversal or reflection symmetry is often imposed: w{^lP5^ = 
where the bar denotes negation. This can be interpreted as the absence of an external field, so 
that there is no preferred direction for the ‘spins’. This symmetry further cuts the number of 
independent vertex weights down to three, which are denoted by a, b, c as shown in Figure 6 . 
Thinking of these as (local) Boltzmann weights with energies Ea, E^ and Ec, we must have 
a, 6 , c > 0 for physical applications. The ice model corresponds to the special case a = b = c 
where each vertex is equally likely. 

Exercise 3.2. Argue that, because of the periodic boundaries, the two vertices shown on the 
right in Figure 6 must occur in equal amounts in any configuration contribution to the partition 
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Figure 5: In ordinary, ‘type Ih’, ice the oxygens constitute a (nearly) perfect hexagonal crystal, 
where the four nearest neighbours of each oxygen form a tetrahedron centred at that oxygen. We 
have indicated the hydrogen bonds in grey. The protons near each oxygen satisfy the ice rule. 


w 



= a 




_^ J 

Figure 6: The ‘allowed’ vertex configurations, with nonzero weights for the six-vertex 

model. The dotted and thick lines denote ‘spin’ —1 and -|-1 on those edges, respectively. 


function. Conclude that one may take them to have equal vertex weights even without imposing 
‘spin’-reversal symmetry. 

Another example of a vertex model is the eight-vertex model. This is a generalization of the 
six-vertex model where each edge still has two possible ‘spin’ configurations but the ice rule 
no longer holds, thus allowing for two more vertices with vertex weight d. These are the two 
vertices in the middle of the configuration from Figure 3. We will briefly come back to the 
eight-vertex model at the end of Section 3.3. 

Graphical notation. To set up the formalism in Sections 3.2, 4.1 and 4.2 we use a graphical 
notation. It is based on the following four rules: 

i) The basic building blocks are the vertex weights drawn as in Figure 4. 

ii) Fixed ‘spins’ are depicted using dotted (e = —1) and thick {s = -|-1) lines like in Figure 6. 
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iii) There is a summation convention for internal lines: whenever two vertices are connected 
by an ordinary (i.e. not dotted or thick) line there is an implicit sum over the two possible 
values of the ‘spins’ on the connecting edge. Thus 


7 7 


7 7 


7 7 


( 5 ' := P 


5' + P 


(3.4) 


represents Yle g{±i} w(/3\) 

iv) In view of the periodic boundary conditions we also need a way to indicate that opposite 
edges of a row or column in the lattice are connected. We draw little hooks to depict this 
periodicity. For example, the partition function specified by (3.1) and (3.3) becomes 


Z{C) 





n 

r 




r 




r 





J 

u 

j 


(3.5) 


This represents a rather complicated expression involving 2KL sums as in (3.4), with the 
summands being products of KL vertex weights a, b, c from Figure 6. 


A nice exercise to get some feeling for this notation and the six-vertex model is the follow¬ 
ing [5, §2.2]. Suppose that c 3> a, 6. The ground state, with maximal w{C), in this regime only 
involves the two vertices on the right in Figure 6. There are two such states: one is shown in 
Figure 7 and the other is obtained from this by a translation by one unit in the horizontal or 
vertical direction. To compute the leading correction to the partition function we may therefore 
restrict our attention to one ground state and calculate Z/2 instead. 

Exercise 3.3. Use the graphical notation to verify that to ninth order in a,h the partition 
function is given by 

\Z = l + V + V + h^) + \V{V+ l)a%‘^+ V a^b^{a^+ h^) + -- - , (3.6) 


where V := KL and we have set c = 1 for convenience. 


3.2 The transfer-matrix method and CBA 

The six-vertex model was solved for three important special cases in 1967 by Lieb [23] and then 
in general by Sutherland [24]. Their solutions combine the transfer-matrix method, allowing 
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Figure 7: A portion of one of the two ground states of the ‘F-model’ in the ‘low-temperature’ 
regime, arising as the special case of the six-vertex model for which a,b. 


one to rewrite the partition function in a quantum-mechanical (linear-algebraic) language, with 
the CBA from Section 2.2. We use tildes to distinguish the six-vertex model’s set-up, developed 
in this subsection, from that of the xxz spin chain. 


Transfer-matrix method. The transfer-matrix method enables one to treat classical stat¬ 
istical systems as if they are quantum mechanical by rewriting the partition function as the 
trace of some operator to get something like in (3.2). The basic idea of the method is to divide 
(3.5) into pieces corresponding to the rows of the lattice. 

To define the Hilbert space over which the trace is taken we start locally, like we did for 
spin chains. Consider a vertical edge in the Ith. column of the lattice. To this edge we assign a 
two-dimensional vector space V) with basis vectors \ai) labelled by the ‘spin’ ai G {±1} on that 
edge: 


Hz = C|-),©C|+), 


C 


©c 


(3.7) 


The Hilbert space associated to a row of vertical edges is constructed as a tensor product of 
these local vector spaces: 


n-.= ®Vi= 0 C|a)= 0 c 








(3.8) 


OL 


The next step is to define the (row-to-row) transfer matrix t on Ti, which counts the con¬ 
tribution to the partition function from the vertices in one row of the lattice: 


t := ^ 


^ G End(fi) . 


1 2 L 


(3.9) 


More concretely, t transfers \a) € Ti (which we think of as a configuration below some row) to 


23 
































a linear combination of I 7 ) (which we imagine living above that row), 


t |q) 
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(3.10) 


The coefficients ( 7 I t |q;) are polynomials in a, b, c (homogeneous of degree L) that encode how 
likely each I 7 ) is for a given |q:). Taking into account all possible ‘spin’ configurations on the 
intermediate horizontal edges, each of these polynomials in principle consists of 2 ^ terms; we 
will soon see that luckily many of these terms are zero. 

Exercise 3.4- Use the graphical notation to compute the matrix of t with respect to the basis 
I-), I—h), |H—), I++) for L = 2. What do you notice about the form of this matrix? 

The use of the transfer matrix comes from the following observation. Powers of the 
transfer matrix can be depicted as 


= 


r 




r 



E) 

r 









1 2 


(3.11) 


Since the partition function (3.5) consists of K such rows, with periodic boundary conditions 
also imposed in the vertical direction, we have 

Z= ^ {a\t^\ot) = tr(t^) , (3.12) 

where the trace is taken over 'H. In this way the computation of the partition function amounts 
to the diagonalization of the transfer matrix. 

The transfer-matrix method was famously used by Onsager in 1944 to solve the Ising model 
on a two-dimensional square lattice. Although Onsager allowed for different horizontal and 
vertical interaction strengths it turned out that the model’s behaviour near the critical temper¬ 
ature is universal in the sense that it does not depend on the ratio between the horizontal and 
vertical coupling constants. This led to the idea of universality in statistical physics, and in the 
following decades more models were found exhibiting the same critical behaviour [1, §1.3]. Only 
in 1972, with Baxter’s solution of the eight-vertex model, it became clear that there are several 
different universality classes. (It is generally believed that at criticality every universality class 
contains an integrable model, which may allow for the exact calculation of the order parameters 
for that universality class.) 
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Towards the CBA. Being familiar with the work on the xxz spin chain, Lieb and Sutherland 
realized that the transfer matrix of the six-vertex model can be diagonalized using the CBA. To 
understand why this is so let us compare the settings of the two models. The first observation 
is that the two Hilbert spaces, % from (2.1) for the xxz spin chain and % from (3.8) for the 
six-vertex model, clearly have the same form. Call the edges of x "Lk vacant when they 
have ‘spin’ —1 (dotted line) and occupied for ‘spin’ -|-1 (thick line). The two Hilbert spaces are 
isomorphic via the identification of spin down and up with vacant and occupied vertical edges, 
respectively: 


9 |Z) = ^ l«) = I-+- )en. (3.13) 

1 h 1 h 

(Note the difference between the labellings on the two sides: I has just M components 1 < < 

■ ■ ■ < Im ^ L, while the corresponding a always has L components, M of which are a plus.) 

Exercise 3.5. Compute the matrix elements {k\t\l) from (3.10) for \k),\l) G Hi, distinguishing 
the cases k < I, k = I, k > 1. 

The second thing to observe is that the operators that we want to diagonalize, i^xxz 
from (2.8) and t from (3.9), are different. A quick way to see this is by comparing the paramet¬ 
ers: the xxz Hamiltonian depends on two parameters (J, A) while the transfer matrix depends 
on the values of the three vertex weights {a,b,c). Despite this difference it may of course be 
possible to diagonalize the two operators using the same Bethe-ansatz technique. 

Exercise 3.6. For another difference between F^xxz and t compare the ways in which excitations 
and occupations in (3.13) are moved by (a single application of) these operators, and compare 
the number of terms in FfxxzIO and t\l). 

Let us take a closer look at the properties of H^xz and t. In Section 2.1 we exploited the 
fact that the xxz Hamiltonian is 

i) nearest neighbour; 

ii) translationally invariant; and 
hi) partially isotropic. 

Since the six-vertex model satisfies properties (i) and (ii), so does the transfer matrix. The 
isomorphism (3.13) suggests that the six-vertex analogue of the M-particle sector Hm ^ H 
is the subspace Hm ^ H with basis vectors |q:) containing precisely M occupancies. By (iii) 
the M-particle sectors are preserved by the xxz Hamiltonian. Let us now show that the ‘M- 
occupancy sectors’ Hm are similarly left invariant by the transfer matrix: thinking of the vertical 
direction of the lattice as (periodic and discrete) time, the occupancy number M is conserved 
for the six-vertex model as a consequence of the ice rule. This can be clearly seen by drawing 
the vertices from Figure 6 as in Figure 8. Indeed, if there are M occupancies below some row 
then, due to the horizontal periodicity, there must be M occupancies above that row as well. 
This line conservation is the six-vertex analogue of the f7(l)^-rotational symmetry of the xxz 
spin chain discussed in Section 2.1. Therefore the decomposition H = ®mHm is preserved by 
the transfer matrix too, so t is block diagonal, and we can diagonalize it in these M-occupancy 
sectors separately. 
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Figure 8: The six vertex configurations with nonzero weights from Figure 6 redrawn such that 
the ice rule for a vertex can be interpreted as line conservation at that vertex. 


Exercise 3.7. Convince yourself that, by line conservation, for given cx and 7 at most two terms 
in (3.10) have nonzero statistical weight, and that there are two precisely for the diagonal matrix 
entries (7 = 0:). Compute these diagonal entries for the M-occupancy sector. (See Figure 9 for 
some examples with L = 3.) 




Figure 9: Two typical examples of the graphical computation of the matrix elements of the 
transfer matrix for L = 3 and M = 2. The top shows that (—h +| 11+ H—) = a(? and the 
bottom says that (+ H—111 + H—) = + a3b. 


The CBA. Having identified the relevant properties of the transfer matrix we are all set to 
apply the CBA for the diagonalization of the transfer matrix in T-Lm- The basic idea is similar 
to what we described for xxz spin chain in Section 2.2 so we keep it brief; the details of the 
CBA for the six-vertex model can be found in [1, §8.3-8.4], see also the end of Appendix 

We want to find iTyf) £ 'Em T H solving the eigenvalue problem t I'kjvf) = ITm)- The 
identification 7^ ~ from (3.13) allows us to use the more convenient ‘occupancy basis’ \l) 
instead of |q:). Expand |Tm) in terms of the M-occupancy basis |/) G TLm — 'Em like in (2.22). 

®In [1, §8.2] the transfer matrix is thought of as acting from above to below a given row, as opposed to our 
convention in (3.10). The actual CBA in [1, §8.3] agrees with (3.14). One can check that, as a consequence, 
Baxter’s equations and results match those in Section 3.3 upon replacing Zm . 
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(3.14) 


The CBA for the coefficients involves parameters pm £ C or equivalently Zm ■= G C, 

M 

^z{l) = , zj := , h<-- - <Im ■ 

tt^Sm m=l 

This produces eigenvectors for the transfer matrix provided we can solve the equations 

{l\ t z) = Am{z) ^z(l) , 1 < h < ■ ■ ■ < Im < L , (3.15) 

for the eigenvalues Am{z), the coefficients At^(z) and the values of the parameters z. The 
strategy is roughly as before: 

1. Focus on the wanted terms, i.e. terms proportional to z^* as in the CBA, to find Am{z). 

2. Find A.,r(z) so as to cancel certain unwanted (‘internal’) terms. 

3. Demand that remaining unwanted (‘boundary’) terms also cancel to get the bae determ¬ 
ining the allowed values of z. 

However, in accordance with Exercise 3.6, the left-hand side of (3.15) contains many more terms 
than its xxz-analogue (2.30). Correspondingly the precise formulation of the strategy is a bit 
more involved than before too, see the end of Appendix B. Let us proceed to the outcome. 

3.3 Unexpected results 

In Section 3.2 we have already found some striking similarities between the xxz spin chain 
and the six-vertex model. Now we will see that the results of the CBA uncover a much deeper 
relation between the two models. 


Results for general M. The results of the CBA for the transfer matrix are as follows. 
Step 1. The eigenvalues of the transfer matrix are 


M 


Am{z) = JI 


.v-1 


m=l 


5 (a — hz^) -I- (T z, 
a {a - bzm^) 


M 


+ <-"11 


m=l 


a{a-bzJ-)-c 
b{a- bzm^) 


(3.16) 


Not being additive, the result is clearly different from (2.36), once more showing that iLxxz 
and t really are different operators. 

Step 2. Again the coefficients A-j^ in the CBA factor into two-occupancy contributions. 
Interestingly they are very similar to what we found for the xxz spin chain: 


A^(z) 

Ae{z) 


S{_ZmiZYa') 1 

l<ni<m' <M 

s.t. 7r(m)>7r(m') 


1 — 2 A(a, b, c) z' + z z' 
1 — 2 A(a, b,c) z + z z' 


(3.17) 


The only difference with (2.37) and (2.32) is that the anisotropy parameter A of the xxz model 
is replaced by a particular combination of the six-vertex weights, 

(A 4- lA — 

A{a,b,c):= + . (3.18) 

2ab 

This striking similarity will play a crucial role in what follows. 


27 







Step 3. There are M BAE for the parameters z: 

r , \ A/f 1 1 — r 1 - 2 ^(d. 6, c) Zfyi “h ZryiZn , , , , 

zi = (-1)^-1 J] , .V ; , - — , l<m<M . (3.19) 

1-2 A(a, 5, C) Zn + ZmZn 

Up to the dependence on (3.18) instead of A these are identical to the xxz bae (2.38). 

Exercise 3.8. Use the results of Exercises 3.5 and 3.7 to check (3.16) and (3.19) for M = 0,1. 
For M = 1 recognize geometric series in z to sum many terms on the left-hand side of (3.15). 

Commuting transfer matrices. The solution (3.17) for At^{z), and therefore is 

remarkable. Firstly, the coefficients (3.17) and the bae (3.19) are precisely the same as those 
of the xxz spin chain when the function (3.18) has hxed value A(a, b,c) = A equal to the xxz 
anisotropy parameter. Indeed, from (3.19) we see that the allowed values of the parameters 
Pm = —ilogZm match those of the pm- Secondly, the eigenvectors of the transfer matrix only 
depend on the six-vertex weights through the combination (3.18). Therefore varying the values 
of a, b, c while keeping (3.18) fixed does not change the (Bethe) eigenvectors of t{a, b, c). 

Under the assumption that all 2^ eigenvectors of t and Hxxz are of the Bethe form (see Ap¬ 
pendix A), these two facts mean that the CBA simultaneously diagonalizes the xxz Hamiltonian 
and all transfer matrices with matching value of (3.18): 

[t{a,b,c), Hxxz{A)] = 0 if A(a, 6 ,c) = A, (3.20) 

[t{a,b,c), t{a ,b',c')] = 0 if A{a,b,c) = A{a',b',c) . (3.21) 

As we will soon see these two observations hold the key to understanding the integrability of 
the six-vertex model — and that of the xxz spin chain. 

To understand the consequences of (3.20)-(3.21) let us first look at the degrees of freedom 
contained in the six-vertex model’s parameters (a, b, c). 

Exercise 3.9. Check that simultaneous nonzero rescalings (a, b, c) i —> (r a,rb,r c) do not affect 
the combination (3.18) and only modify the partition function (3.5) by an overall factor. 

Motivated by this let us fix the ratio a : b : c and the value of the function (3.18). This 
leaves a single remaining degree of freedom, known as the spectral parameter, which we denote 
by u. Observe that, through the vertex weights, the transfer matrix also depends on the spectral 
parameter, t{u) = t(^a{u),b{u),c{u)). We can now recast (3.20)-(3.21) in the form 

[t{u), Efxxz] = 0 for all u , (3.22) 

[t{u),t{v)] =0 foralltt,u. (3.23) 

Therefore there is a one-parameter family of six-vertex models, with a : b : c and A fixed but 
varying u, whose transfer matrices t(u) commute with Hxxz and each other. 

Exercise 3.10. Check that the parametrization 

a = / 9 sinh(u -|- iy) , b = psinhu , c = / 9 sinh(i 7 ) = ip sin 7 (3.24) 
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does the job, with A(a, b, c) = cos 7 . Determine the values of the crossing parameter 7 corres¬ 
ponding to the regimes A<—1, —1<A<1 and A > 1 of the xxz model. (Correspondingly, 
shifted or rescaled parameters are also commonly used in the literature.) 

Z-invariant models. How should the commutator (3.23) be interpreted from the vertex- 
model viewpoint? In terms of our graphical notation it consists of two terms of the form 


t{u) t{v) 


r 




r 









(3.25) 


with a separate spectral parameter associated to each row as indicated. This can be viewed as 
a portion of a vertex model with different values of the spectral parameter — hence different 
vertex weights yielding the same value of (3.18) — for each row of horizontal edges in the 
lattice. By (3.23) the partition function Z (3.1) of such vertex models are invariant under the 
exchange of any two rows in the lattice; accordingly those models are called Z-invariant. Thus 
the six-vertex model admits inhomogeneous generalizations that can still be tackled using the 
CBA: the translational invariance in the vertical direction is broken in such a way that the model 
remains exactly solvable. 


Analyticity. Baxter realized that it is extremely useful to allow for complex vertex weights 
and let u G C. Indeed, (3.24) then gives an analytic parametrization of the six-vertex weights 
which is even entire (in fact, this is how that parametrization can be found, see [1, §9.7]). The 
real power of the transfer-matrix method lies in the fact that all functions u 1 — {k\t{u)\l) are 
entire as well, because they are polynomial in a, b, c. This highly constrains the properties of the 
xxz and six-vertex model and ultimately renders these models exactly solvable. For example, 
the BAE have a natural interpretation in this context: 

Exercise 3.11. Since the transfer matrix is entire in u, so must be its eigenvalues Az(z). However, 
the right-hand side of (3.16) seems to have a simple pole for each u such that a{u)/b{u) = 
for some 1 < m < M. Use that Res(/, z*) = g{z^)/h'{z^) when f{z) = g{z)/h{z) with h{z^) = 0 
but h'{z^) / 0 to check that the residues at these poles satisfy 


M 


Res (Am, = ^) oc ( 1 


n=l 

n^m 



n^m 


(3.26) 


and conclude that the poles of Am disappear by virtue of the bae (3.19) with Zm = b/a. 

In particular, if one would be able to obtain the eigenvalues (3.16) of t in a different way, one 
might be able to derive bae also for models for which the Bethe-ansatz techniques described 
in these lecture notes fail, such as the XYZ spin chain and eight-vertex model. This is precisely 
what Baxter’s TQ-method manages to do by constructing another one-parameter family of 
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commuting operators Q{u) G End(?^) that satisfy certain ‘TQ-relations’ determining Am- For 
more about the TQ-method we refer to [1, §9]; see [25, §4-5] and [2, 4.2] for an account in the 
algebraic language of Section 4. 

Quantum integrability. To get a better understanding of the importance of the relations 
(3.22)-(3.23) let us parametrize the vertex weights as in (3.24). Since the transfer matrix then is 
a Laurent polynomial in e“, it makes sense to take logarithmic derivatives and define operators 
Lffc via the trace identities 


Hk := 


d^ 

du^ 


logt{u) G End(?^) ~ End(?f) 

U = U:^ 


(3.27) 


for some value n* of the spectral parameter. (In Section 4.1 we will see that u* = 0 is a 
convenient choice for our parametrization.) 

The equations (3.22)-(3.23) then imply 

[Hk: Hxxz] — 0 for ^ ) (3.28) 

[77j,77fc] =0 for all . (3.29) 


Now we can see the fruits of our labour more clearly. According to (3.28) the operators Hj. are 
conserved quantities: we have found symmetries of the xxz spin chain! Moreover, by (3.29) 
these symmetry operators commute with each other (they are in involution). The presence of 
such a tower of commuting conserved charges is a very special property; it ‘proves’ that the 
model is quantum integrahle in analogy with the notion of Liouville integrability and explains 
why magnon-scattering is two-body reducible, see Section 5.1. Thus, from the spin-chain view¬ 
point, there is a one-parameter family of six-vertex models whose transfer matrices produce 
symmetries of the xxz model through the trace identities. (In more mathematical terms 
the t{u) generate an abelian subalgebra in End(77) that commutes with Lfxxz-) What about the 
six-vertex model itself? 

Consider a six-vertex model with vertex weights (ao,6o,co) and transfer matrix fo •= 
t{ao,bo,co). Setting Aq := A{ao,bo, cq), by the above argument there exists a one-parameter 
family of six-vertex models with commuting transfer matrices, like in (3.29), such that t{uo) = to 
for some uo- From the original six-vertex model’s perspective each of these transfer matrices 
generates a discrete Euclidean ‘time’ evolution with respect to which the are ‘conserved’. 
In particular it follows that 

[Hk,to] = 0 for all k , (3.30) 

so to enjoys the same symmetries (3.27) as i7xxz(Ao)! 

Exercise 3.12. Go back to Section 2.1 to find a few operators that one may expect (or hope) to 
find amongst the G End(7f). 
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Summary. The preceding discussion can be schematically summarized as follows: 



Hxxz <-symmetries-*■ to 


(3.31) 


Although we have not uncovered the exact relation between the xxz and six-vertex models 
yet, Table 1 contains a dictionary with our findings so far. Let us stress once more that the 
correspondence between the two models is not a bijection; the xxz spin chain with anisotropy A 
corresponds to a whole family of six-vertex models parametrized by u. The precise connection 
between the two sides will become clear in Section 4.1 when we compute the first few Hk 
contained in the transfer matrix. 


xxz spin chain 

(Family of) six-vertex models 

lattice Xl 

basis vector \l) £ H 
pseudovacuum H) 
excited spin at site 1 

row of vertical edges in x Zjy 

configuration \ol) £ H on a, row 

configuration-• • • —) 

occupancy at edge 1 

translational symmetry 
partial isotropy 
anisotropy A 

horizontal translational symmetry 
ice rule/line conservation 

A(a, 6, c) = (a^ + b‘^ — c^)/2ab 

quasimomentum pm 

parameter pm = —i log Zm 


Table 1: Comparison between the ingredients of the xxz spin chain and those of the corres¬ 
ponding one-parameter family of six-vertex models. 


4 The quantum inverse-scattering method 

In the previous sections we studied the CBA for the xxz spin chain and the six-vertex model. 
The details were deferred to Appendix B: solving the equations (2.30) for the M-particle sector 
of Hxxz is rather involved as all cases with neighbouring excitations must be taken into account, 
and the equations (3.15) for the M-occupancy sector of the transfer matrix are still harder to 
obtain. It would be nice if there is an easier way to diagonalize these operators and derive the 
BAE. 

Next, through the transfer-matrix method in Sections 3.2 and 3.3 we found a correspondence 
between the xxz spin chain with anisotropy A and a one-parameter family of six-vertex models 
parametrized by the spectral parameter u £ C. In particular, Hxxz and t are simultaneously 
diagonalized, and the family of commuting transfer matrices t{u) generates a tower of symmet¬ 
ries Hk for both sides via the trace identities. However, the precise relation between t{u) and 
Hxxz is not clear yet. In addition, computing the Hk directly from the transfer matrix is rather 
cumbersome; in fact the special value u* of the spectral parameter still has to be determined. 
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In this section the quantum inverse-scattering method (qism) is introduced. This algebraic 
formalism can be used to rederive the results from Sections 2.3 and 3.3 while addressing the 
above issues, and more: 

• it provides a convenient way to find an appropriate value u* and compute the Hk using 
the trace identities; 

• it has the all-important commutativity of the t{u) built in; 

• the Fock space of (Bethe) states is constructed via creation and annihilation operators, 
and the eigenvalues and bae are derived with a single computation for general M ; 

• it allows one to define several new quantum-integrable models. 

Since the xxz and six-vertex models are treated simultaneously in the QiSM we no longer need 
to distinguish between H ~ Ff, et cetera, and can safely drop the tildes used in Sections 3.2 
and 3.3 from now on. We keep h = J = 1. 


4.1 Conserved quantities from Lax operators 

We keep using the graphical notation introduced in Section 3.1, with one additional rule: 

v) To keep track of the ordering of the various operators in products we add little arrows to 
the end of lines. (Thus these arrows are not related to the ‘spins’.) 

We will see an example of this rule soon, e.g. in (4.8). 


Lax operators. From the six-vertex point of view the choice to associate a vector space to 
the rows of x Zk, but not its columns, is somewhat unnatural. To treat the horizontal and 
vertical edges on a more equal footing we introduce the vector space 

Ca:=C|-)„©C|+), = C . ©C - . (4.1) 

spanned by the two possible ‘spins’ on a horizontal edge in the lattice. (The subscript in 14 is 
not related to the vertex weight a{u).) Motivated by the spin-chain viewpoint, the ‘vertical’ Vi 
and % from (3.7)-(3.8) are then often called physical or quantum spaces, while 14 is an auxiliary 
space. It is quite convenient to think of the spectral parameter u of the transfer matrix as being 
associated to 14. 

The auxiliary space allows us to introduce ‘local’ (vertex) operators acting at a single vertex 
of the lattice: the Lax operator ,defined as 


i 

(4.2) 

i 


Lal{u) ■■= “ 


“ G End(14 © Vi) . 


^This operator is sometimes called the ‘_R-matrix’, but we follow Faddeev [3] and reserve the latter terminology 
for a closely related operator acting on Va ® Vb, see Section 4.2. 
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The subscripts in Lai{u) remind us on which vector spaces this operator acts nontrivially, in 
accordance with the tensor-leg notation from the start of Section 2.1. The labels on the top 
and right of (4.2) will be omitted in the graphical notation from now on. 

Explicitly, writing |/3,a) := |/3) (8) |a:) for the (pure) vectors in 14 <8> Vi, (4.2) means that 


Lai{u)\l3,a) = ^ 




(4.3) 


where we also indicated the dependence of the vertex weights on u G C on the right-hand side. 

We point out that one has to be a bit careful when reading off the coefficients for the 
‘outgoing’ vector I 7 , (f) G 14 ® E) in our graphical notation: unlike for the ‘incoming’ vector 
|;0, a), the order of the labels 5 and 7 is reversed in the coefficients in the middle and on the 
right-hand side of (4.3), cf. the labels in (4.2). This reversal will come back in (4.17) below. 

Exercise 4-1- Use Figure 6 to check that the matrix of the Lax operator with respect to the 
(standard) basis 






+-) = 



!++) = 



(4.4) 


of 14 <8) Vi is given by 


Lai{u) 


b{u) 

c{u) 


c{u) 

b{u) 


\ 


(4.5) 


V «(^)/ al 

Exercise 4- Argue that the ice rule for the vertex weights is equivalent to the invariance of 
the Lax operator under simultaneous [/(l)z-rotations in I 4 and V): 

[S^, + S!,Lai{n)] = 0. (4.6) 

Exercise 4-3- Check that (4.5) can be written in a basis-independent way as 

Lai{u) = +c{u)a+S^ + c{u)a-S+ + {a{u) - b{u)) a^Sf , (4.7) 

and use this to verify (4.6) directly via the su(2)-relations (2.4). 


Trace identities. Lax operators can be used as building blocks of other, more involved op¬ 
erators. In particular, the transfer matrix (3.9) can be constructed as 


t{u) = tla {LaL{u) ■■■ La2{u)Lal{u)) 


-^ G End{n) . (4.8) 

1 2 L 
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This way of writing the transfer matrix is very useful for the computation of conserved 
charges of the xxz and six-vertex models using the trace identities. 

As we will soon see, it is actually convenient to alter (3.27) slightly by setting 

log G End{n) . (4.9) 

a(u)^ 

From Section 3.3 we know that the trace identities yield symmetries for any choice of the value u* 
of the spectral parameter; let us see whether there are any particularly convenient choices. We 
parametrize vertex weights by (3.24) with p = 1, 

a{u) = sinh(u -|- iy) , h{u) = sinh(u) , c = sinh(i 7 ) = isiny . (4.10) 

Observe that h{u) vanishes at u* = 0, while a(u*) = c. At this point the Lax operator takes a 
particularly simple form: 

Lal{u*) = C PaZ , (4.11) 

where the permutation operator (braiding) is defined as 


H. :=i 




PaZ 



•— 2 ^ 0.1 £ End(14 <8) Vz) . 


(4.12) 


Exercise 4-4- To justify this graphical notation, check that the permutation operator switches 
vectors: Fai\P,ct) = \a,(3) for (basis) vectors \(4,a) = |/3)®|a) G Va®Vi. Show that tr^PaZ = Iz 
both algebraically and using the graphical notation. 

Conserved charges. Now we are all set to compute the first few symmetries using (4.9). 
Hq = logt(M*) is easy to find. By (4.8) and (4.11) we have t{u^) = tTa(PaL ■ ■ ■Ea 2 Eai)- 
Focus on the product of permutation operators. It is quite standard to rearrange such products 
by repeated application of the rule PaZcPaZ = PazPfcZj see e.g. Faddeev [3]. For a slightly more 
slick way to do this we exploit the relations in the permutation group by introducing for each 
permutation vr G Sl+i an operator Pj^ G End(V'a(8)Lf) switching vectors in Va^EL = Va® iu 

the way specified by tt. For example, from the transposition (al) G Sl+i we recover P(ai) = PaZ- 
(The above rule now is clear from PaZcPaZ = P(aZfc) = PazPfcZ-) We calculate 

f(n*) = C^tra{E(al 2 -L)) = C-^tr„(PaiP(i 2 ...L)) = C-^tra(Pai) P(i 2 ...L) = U~^ , (4.13) 

where we have used tr^PaZ = Iz and recognized the cyclic permutation operator P(i 2 ...i,) as the 
inverse of the shift operator U = e''^ G End('H). Thus 

is the momentum operator of the XXZ spin chain! 
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Exercise 4-5. Check that Xi = P^r ^ 7 r(i) any Xi G End(Vi) C End(Pa(8)^), i = a, 1, • • •, L, 
by applying both sides to any arbitrary (basis) vector |/3, ai, • • •, a^) = \ j3) ® la) G Va^T-L. 

Exercise 4-6- To find Hi it is convenient to set Lai{u) ■= PaiLai{u). Apply the result of 
Exercise 4.5 to Xai = L'^iiu) to check that Next compare 

L'i_ii{u^) for (4.10) with (2.10). Finally use (4.9) to show that Hi is nothing but the xxz 
Hamiltonian in disguise: 

Hi = it{u,)-H'{u,) - iL 1 = — (i/xxz - i?o 1) , (4.15) 

a[u^) smy 

where Eq is the vacuum energy (2.19). (Note that Hi's eigenvalues are proportional to the £m-) 
The higher H^ can in principle be computed in a similar fashion. The result is a sum of 
more and more nonlocal operators: H 2 consists of terms involving next-to-nearest neighbour 
interactions, see [2, Ex. 2.7], and so on. 

We conclude that the one-parameter family of commuting transfer matrices t{u) of the six- 
vertex model contain important observables of the xxz spin chain. The trace identities in fact 
provide a concrete relation between the two sides, connecting physical properties of the xxz 
model, such as the momentum and energy, to the eigenvalues of the transfer matrix, determining 
the partition function. Together with an equation relating the correlation functions of the two 
models (see e.g. [5, § 2.3]), this establishes the precise correspondence between the models from 
Sections 2 and 3. 

4.2 The Yang-Baxter algebra 

In Section 3.3 we found, for fixed A, a family of commuting operators t{u) G End(77) that 
generate symmetries Hk rendering the xxz and six-vertex models quantum integrable. In this 
subsection we get to the heart of the QISM starting from such commuting transfer matrices. 
It turns out that there is a sufficient ‘local’ condition: the fundamental commutation relations 
(for). These relations are closely related to the Yang-Baxter equation (ybe) for the R-matrix. 

Monodromy matrix. Rather than directly imposing horizontal periodicity to obtain the 
transfer matrix it is useful to define the ‘global’ monodromy matrix oxiVa®H as an ordered 
product of Lax operators: 

Ta{u) := Lal{u) := Lahiu) ••• La2{u)Lal{u) 

IG'^l 

1 1 . 

= a, -■■■- > G End(14 (g) 7^) . 

1 2 L 

The harpoon in ‘]]][’ points in the direction of increasing 1; notice that this order of the Lax oper¬ 
ators is consistent with the order indicated by the little arrows in our graphical notation. (Also 
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note that, as always, subscripts corresponding to the ‘global’ space % are omitted in the tensor- 
leg notation.) The transfer matrix t{u) = iTaTa{u) arises as a trace of the monodromy matrix 
over the auxiliary space, corresponding to horizontal periodic boundary conditions, cf. (3.9). 

Like in (4.3) one should notice that the order of the labels <5 and 7 in the coefficients of the 
‘outgoing’ vectors are reversed in the graphical notation: 


/ 


Ta{u)\l3,OL) = 


E E 

<5e{±i}7e{±i}^ 


7i 72 


/3 


V 


Oil 02 


IL 


OtL 


1 ^, 7 ) 


(4.17) 


7?TT-relation. For graphical computations it is often convenient to depict vectors in the 
global Hilbert space (3.8) simply by a single ‘fat’ arrow, which we indicate by a triple line. This 
leads to following graphical shorthand: 


4k 


TJu) = 


1---L 


4 k 


t{u) = 


l -L 


(4.18) 


The commutativity (3.23) of t{u) and t{v) can then be depicted, cf. (3.25), as 


r 


7 

r 



1- 


■L 


c 


c 

7 




l -L 


V 

u 


(4.19) 


Now consider two copies of the auxiliary space, 14 and I 4 , with spectral parameters u and v. 
Of course (4.19) holds if Ta{u) and Ti,{v) would commute in End(14 



K 






l -L 



V 






1---L 


(4.20) 


but a direct check (even for L = 1, in which case Ta is just the Lax operator) shows that this 
is not true for generic values of u and v. 

Exploiting the horizontal periodicity, however, we can write down another equation that 
is not too restrictive while still guaranteeing (4.19): the FOR. For this we need an operator 
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Rah{w) £ End(14 ® Vf,), rather unimaginatively called the R-matrix. This operator is allowed 
to depend on some spectral parameter w, and should be invertible for generic values of w. We 
depict the i 2 -matrix and its inverse as 


Rab{w) = 

b^ ’ 

KbH = , 

a / ^ 


(4.21) 

SO that the products 





a ^ a 

‘ ^ab 5 


:= Ifea 

(4.22) 







are the identity operators, while the square of either operator in (4.21) is not. (Since u and v 
are associated to 14 and 14 respectively, cf. (4.19), one may hope to be able to express w in 
terms of u and v, this will indeed be the case.) As for the Lax operator and the monodromy 
matrix, the order of the ‘incoming’ and ‘outgoing’ labels is reversed for the coefficients in the 
graphical notation: 


= E ,„X« 

These coefficients have to be determined. 

The use of the ii-matrix comes from two theorems that give ‘global’ and ‘local’ conditions 
on the i?-matrix guaranteeing (4.19). We start with the ‘global’ theorem: 

Theorem 1. If there exists an R-matrix Rah{w) G End(14 ® Vb) which 

i) is generically invertible; 

a) satisfies the following ‘global’ FOR in End(14 ® Vb ®R): 

Rab{w) Ta{u) Tb{v) = Tb{v) Ta{u) Rab{w) , (4.24) 

then the transfer matrices t{u) and t{v) commute. 

Proof. Keeping track of the order in which the operators act, (4.24) can be depicted as 


(4.23) 




(4.25) 
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By multiplying both sides in (4.25) from the left by the inverse of the i?-matrix we obtain the 
equivalent relation 



b 

a 



V 






1---L 


(4.26) 


Taking the trace over both auxiliary spaces we conclude that t{u) and t{v) commute using the 
cyclic property of the trace. □ 

Thus we ask for the monodromy matrices Ta{u) and Tb{v) to commute up to conjugation 
by the i2-matrix (in more algebraic terms: we want the i2-matrix to intertwine the actions of 
the two monodromies). Equation (4.24) is often referred to as the RTT-relation for obvious 
reasons. It is global in the sense that it involves operators Tq and acting on the ‘global’ 
Hilbert space %. Since Ta{u)Tb{v) = (Ta{u) 0 l)(l0 T;,(u)) = Ta{u) ®Tb{v), (4.24) can 
rewritten as 

Rab{w) {T{u) 0 T{v))ab = {T{v) 0 T{u))ab Rab{w) ■ (4.27) 

Before we continue let us simplify our graphical notation a bit. In (4.21) we used under- and 
overcrossings to distinguish between the ii-matrix and its inverse, both of which were necessary 
for the above proof. However we have no further need to depict from now on. Thus we 

may drop this inverse from our graphical notation, and update (4.21) to the simpler rule 


Rab{w) = . (4.28) 

In this notation the i?-matrix still differs from the Lax operator by the labels of the lines. 

The task of finding a suitable i?-matrix is simplified by the following ‘local’ version of 
Theorem 1. 

Theorem 2. If there exists an R-matrix Rab{w) G End(I4 0 14) which 
i) is generically invertible; 

a) satisfies for any (and hence all) I G Z/, the following ‘local’ FOR in End(I4 0 14 0 Vi): 

Rabiw) Lai{u) Lbfiv) = Lbiiv) Lafiu) Rab{w) , (4.29) 

then the transfer matrices t{u) and t{v) commute. 

Proof. By Theorem 1 it suffices to show that the i?TT-relation (4.24) is equivalent to (4.29). 
Since the latter is obtained from (4.24) in the special case with only one lattice site {L = 1), 
which we may always label by I, the i?TT-relation implies the local FOR. To see that (4.29) is 
also sufficient we use the following graphical (yet rigorous!) ‘train argument’. 
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Suppose that (4.29) holds, so that Lax operators acting in the same local physical space, 
but different auxiliary spaces, commute up to conjugation by the i?-matrix. Diagrammatically 
(4.29) says that the vertical line, corresponding to Lax operators acting on two vertices above 
each other, can be moved through the crossing representing the i?-matrix: 


a 

b 




(4.30) 


From the definition (4.16) of the monodromy matrix, L applications of (4.30) do the job: 





12 L 


□ 


In statistical mechanics (4.29) is often referred to as the star-triangle relation.^^ If a model 
admits an i?-matrix that satisfies conditions (i)-(ii) one can construct symmetries from the 
transfer matrix as described in Section 4.1. Such models are moreover solvable via techniques like 
the algebraic Bethe ansatz, see Section 4.3. For this reason such an i?-matrix is the integrahility 
datum allowing one to study quantum-integrable models from an algebraic point of view. Indeed, 
in practice one often uses this structure to define ‘quantum integrability’. 


i?-matrix. The upshot of the preceding discussion is that if we can find an ii-matrix sat¬ 
isfying the FOR (4.29) for a given Lax operator then the transfer matrices constructed from 
that Lax operator commute. In Appendix C the FOR of the xxz/six-vertex model, with Lax 
operator (4.2), is solved for a nontrivial i?-matrix that respects the symmetries of the model: 
it satisfies both line conservation (the ice rule) and spin-reversal symmetry. The result is of the 

®We reserve the name ‘Yang-Baxter equation’ for the rather similar and intimately related (but algebraically 
still more fundamental) equation that we will encounter soon. 
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same form as the Lax operator (4.5): 


Rab{w) 


/a{w) 

b{w) c(w) 
c(iu) b(w) 

V 


\ 


(4.31) 


where the functions a, b and c were defined in (4.10). Accordingly, the entries of the i?-matrix 
may be interpreted as the vertex weights of another six-vertex model with the same value 
of A = cosy but with different spectral parameter w. 

Clearly (4.31) is indeed invertible for almost all values of the spectral parameter. In Ap¬ 
pendix C we further show that this i?-matrix solves the FOR provided the spectral parameters 
are related by the difference property w = u — v. (Of course one can also directly check that in 
this case (4.31) does indeed satisfy the FCR, see e.g. [3, §10].) Due to the difference property 
the FCR is often written as 


Rah{u - v) Lai{u) Lu{v) = Li,i{v) Lai{u) Rab{u - v) , (4.32) 

and likewise for the i?TT-relation. This result nicely fits in the graphical notation if we 
straighten out the lines in (4.30): 




(4.33) 


Here the spectral parameters of the operators are included as angles, and the FCR says that any 
single line may be shifted past the intersection point of the other two lines if it is kept parallel 
to the original line. (Note that such shifts are in fact used in Section 5.1 to derive the ybe for 
factorized scattering.) 


Yang-Baxter algebra. The following algebraic construction lies at the core of the QISM and 
provides the mathematical setting for the computations in the framework of the algebraic Bethe 
ansatz, as we will see in Section 4.3. Since the monodromy matrix Ta{u) G End(I4 (8> R) also 
acts in auxiliary space we can write it as a matrix on 14 , 


Ta{u) 


(A{u) B{u)\ 
\C{u) D{u))j 


whose entries act on the physical space R of the spin chain. 


(4.34) 
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Exercise ^.7. Check that in our graphical notation 


A{u) 


C{u) 




l-L 


A 


l -L 


B{u) 


D{u) 


A 



(4.35) 


These (one-parameter families of) operators in End('H) generate a (unital, associative) al¬ 
gebra, known as the Yang-Baxter algebra (yba), whose commutation rules are given by the 
i?TT-relation (4.24) with w = u—v. The latter encodes 2^ x 2^ = 16 relations in End(14(8) 14) for 
the generators (4.35). The explicit form of these relations can be found from (4.27) by straight¬ 
forward matrix multiplication, see [3, §4]. Instead one can also use the graphical form (4.25) of 
the i^TT-relation to find these relations. For example, the (l,4)-entry of (4.27) corresponds to 


A 






(4.36) 


1---L 


l -L 


implying that 


B{u)B{v) = B{v)B{u) . 


(4.37) 


Likewise, paying attention to the different ordering of ‘incoming’ and ‘outgoing’ auxiliary vec¬ 
tors, the (1,3)- and (3,4)-entries of (4.27) correspond to 


itK 






+ V 



(4.38) 


(4.39) 
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Upon interchanging tt o u in (4.38) these yield more complicated commutation rules: 

"<"> - RiHHl 

In both of these relations the first term on the right-hand side just contains the commuted 
operators (up to a factor), whereas in the second term the two operators have in addition 
interchanged their spectral parameters. 

The physical use of the YBA stems from the quantum inverse-scattering problem, which 
asks whether it is possible to reconstruct arbitrary operators in End(V;), and thus those in 
End(?^), from Ta{u). The solution to this problem was found for many models, including the 
xxz/six-vertex model, in [26]. The conclusion is that A(u), ■ ■ ■, D{u) generate all of End('H). 
For example, the transfer matrix is an element of the Yang-Baxter algebra: 


t{u) = 


J 

K 







A 


+ 


= A{u) -\- D{u) 


(4.42) 


l-L 


1---L 


Exercise 4-8. Find relations like (4.36) for A and for D. Next use (4.25) to compute [A{u), D(v)]. 
Check in this way that t{u) and t{v) do indeed commute. 


Yang-Baxter equation. The entries of the i2-matrix play the role of structure constants for 
the Yang-Baxter algebra. In the present context there also is an analogue of the Jacobi identity 
for these ‘structure constants’. Indeed, consider one more copy of the auxiliary space, Vc, with 
associated spectral parameter w. The iiTT-relation can be used to reverse the order in the 
product Ta{u) Tb{v) Tc{w) to get Tc{w) Tf,{v) Ta{u) up to conjugation by products of i2-matrices. 
Now there are two ways in which this can be done, corresponding to the two decompositions 
(13) = (12)(23)(12) = (23)(12)(23) of the permutation switching the first and third monodromy 
matrix. To avoid 2^ x 2^ = 64 additional relations for the Yang-Baxter algebra, the two results 
must coincide. This is true when the ii-matrix satisfies the famous Yang-Baxter equation (ybe) 
in End(Ua (8) 14 (8) 14 ): 

Rab{u - v) Rac{u - w) Rbc{v - w) = Rbc{v - w) Rac{u - w) Rab{u - v) . (4.43) 

Like the Jacobi equation, this relation is cubic in the ‘structure constants’. One can check that 
the solution (4.31) of the six-vertex FCR does indeed satisfy the ybe. In our graphical notation 
(4.43) becomes 
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Readers familiar with the braid group may recognize this as an analogue of the braid relation 
but involving spectral parameters. Of course the lines may again be straightened out like we 
did in (4.33). 

Summary. To conclude this subsection we present a brief overview of the formalism that we 
have set up. The operators of the QISM, the equations that they satisfy, and relation between 
these operators is shown in Table 2. Any physical operator, in End(?^), can be expressed as an 
element of the YBA, i.e. in terms of the generators A(u), • • •, D{u). In particular the YBA can 
be used to construct the Bethe vectors, which is our next topic. 



auxiliary 14 

local Vi 

global % 

auxiliary 14 

physical result 

Rab{u) : YBE 

Lai{u) : FCR ^ 

-> Ta{u) 

A(u),---,J 
t{u) : c( 

RTT 

' 

D{u) : YBA 
rmmute 


Table 2: Summary of the QiSM in the spin-chain language, where 14 plays an auxiliary role. 


4.3 The algebraic Bethe ansatz 

Our final task is to reproduce the results of the CBA from Sections 2.3 and 3.3 in the context 
of the QISM. The goal is to diagonalize the transfer matrix (4.42); the spectrum of the xxz 
Hamiltonian (4.15) then follows from the trace identity (4.15). We proceed along the lines 
of Faddeev [3]. Although there are still some nontrivial calculations involved in the algebraic 
Bethe ansatz (aba), it is much easier to get the eigenvalues Km and the bae for any M-particle 
sector than it is with the CBA. 


Second quantization. The CBA from Section 2.2 features the Bethe wave function (2.29): 
this is the first-quantized approach to the quantum-mechanical spin chains. In contrast, the 
ABA corresponds to second quantization through the explicit construction of a Fock space of 
states for the model. As a first attempt to construct such a Fock space let us briefly go back to 
the CBA. We already have a good candidate for the Fock vacuum: the pseudovacuum |H) G T-Lq 
from (2.13). For M = 1, (2.20) suggests that serve as a creation operator for 

a magnon with quasimomentum p^- Unfortunately already for M = 2 we see that this cannot 
be true. Indeed, applying that operator twice on |H) gives A{pi,p 2 ) = A\pi,p 2 ) in (2.26), only 
allowing for trivial two-body scattering. To proceed we have to exploit the Yang-Baxter algebra 
from the previous subsection. 

Again we start from the pseudovacuum, which is depicted in our shorthand as 


|H) = 



(4.45) 


43 











In view of (4.17) and (4.35) we have 


7 


.4(a) |n>= 

76{±l}i 


It) 


\n) = aiu)^\n) , (4.46) 


where in the second equality the sum over outgoing configurations collapses to a single term by 
line conservation. 

Exercise 4-9. Show in the same way that |fl) is also an eigenvector of D{u) and C{u), 

C{u) |fl) = 0 , D{u) |0) = b{u)^ |Sd) , (4.47) 

while 

= (4,48) 

Thus B{u) and C{u) present themselves as candidates for raising and lowering operators, 
respectively. For this to make sense B{u) must map Bm into Bm+i for each M-particle sector, 
while C{u) should act in the opposite direction. Graphically it is obvious that this is indeed 
the case: by line conservation B{u) ‘injects’ an excitation (occupancy) into the global quantum 
space B, while C{u) ‘absorbs’ one. We conclude that B{u) and C{u) may indeed be used to 
build a Fock space starting from | fl). 

Exercise 4-10. For an alternative argument check that the ice rule [S^ + ,Ta{u)] = 0, cf. (4.6), 

implies that the generators of the yba satisfy 

[S^,A{u)] = [S^Diu)]=0, (4.49) 

[5^ B{u)] = -B{u) , [5^ C{u)] = C{u) , (4.50) 

and compare (4.50) with (2.4). 

For this construction to reproduce the results of the CBA we also need a way to include the 
parameters pm = —ilog^m- In the present set-up there is already an obvious candidate to fulfil 
this role: the spectral parameter u. If B{u) is to create a physical state we should in particular 
be able to match (4.48) with the magnon-solution (2.20) to reproduce the spectrum for M = 1. 
This requires 

ziu) = e'PW = , (4.51) 

a[uj 

which is consistent with Exercise 3.11 in Section 3.3. 


Algebraic Bethe ansatz. According to (4.49) the Hilbert space B splits into M-particle 
sectors as in (2.17), where each Bm is preserved by the transfer matrix (4.42). Motivated by 
the preceding discussion, for suitable values of the spectral parameters u G C^, let us look for 
eigenvectors in the M-particle sector of the form 

\^M',u) := B{ui) ■ ■ ■ B{um)\^) €Bm- (4-52) 
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This is the algebraic Bethe ansatz for the Bethe vectors employed to diagonalize the transfer 
matrix and spin-chain Hamiltonian. The strategy is as follows: 

1. Use (4.42) and the relations from the Yang-Baxter algebra to work out t{uo) \^m]u). 

2. Read off Am{z) from the wanted terms, proportional to \^m',u) as in the ABA. 

3. Demand that the unwanted terms cancel to get the bae for the allowed values of u. 

Like for the CBA, the ansatz (4.52) will only work for specific values of u, but unlike before 
there are no unknown coefficients that have to be determined. Thus, this time all effort goes 
into Step 1, which can be done using a nice trick based on (4.37). 

Step 1. We have to compute the two terms in 

M M 

t{uo)\'^M',u) = A{uo) Y\ Bium)\^) + D{uo) R(rtm) |f^) • (4.53) 

m=l m=l 

We start with the first term on the right-hand side. Using (4.40) we can move A(uo) past B{ui): 

A{uo) B{um) = B{ui) A{uo) - B{uo) A(ni)^ B{um) ■ (4.54) 

Continuing in this way we obtain 2^ terms, each proportional to ( Ylu^n some 

0 < H < M. As |D) is an eigenvector of A{u^), see (4.46), the result must be of the form 

M M 

A{uo) \^M-,u) = ^ M^{uq, u) B{uu) |D) . (4.55) 

/i=0 u=0 

Two of the coefficients are easy to compute. Firstly, only one of the 2^ terms contributes 
to = 0: this is the term where we always pick up the first term in (4.40), giving 

Mo{uo,u) = a{uo)^ ^ . (4.56) 

b[u„, - uo) 

Secondly, the coefficient for /U = 1 also only has one contribution: this comes from the second 
term on the right-hand side of (4.54), where we always pick up the first term in the subsequent 
steps of (4.40). Thus we find 


Mi{uo,u) 


-a{ui)^ 


c{ui - Up) 
b(ui - Uo) 


n 


a(Un 

b(Un 


Up) 

Up) 


(4.57) 


The other coefficients receive more and more contributions, and their calculation appears to 
be a complicated task. Luckily there is a neat trick that exploits the yba to obtain the other 
coefficients without much effort. Indeed, recall that by (4.37) the B’s commute. (We actually 
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already used this in writing an ordinary product in the ABA; else we should have specified an 
ordering.) Therefore we may rearrange the creation operators in (4.54) in any way we like; 
in particular we may put B{um) in front. Then, by switching 1 and m in (4.57), the above 
argument immediately yields 


, , / X ( \L ^o) TT n(^n Uva) 

Mm{uo,u) = -a{Um) 77 - 7 77 - 7 - 

b{Um - Uo) b{Un “ Itm) 

n^m 


The coefficients Nfj^{uQ,u) in 


M M 

D{uq) \^m;u) = '^Nf,{uo,u) B{u,y) | 0 ) 
1 - 1=0 


u=0 


(4.58) 


(4.59) 


are computed in a similar way, now using relation (4.41) from the YBA together with (4.47) and 
of course the trick. The result is 


No{uo,u) = b{uo)^ ^ , 

b{uo - Urn) 
m=l ^ ' 


M 


AT- / \ I / \L *'(^0 Urr^ -p-r a{Um Un) 

Nm{Uo,u) = -b{Um) 77 - 7 M 77 - 7 

5(no - Urn) b{Um - Un) 

n^m 


(4.60) 

(4.61) 


Step 2. Since only the terms with /r = 0 in (4.55) and (4.59) are of the wanted form, the 
eigenvalues are given by 

A„(u„; u) = f) f] ~ . ( 4 . 62 ) 

b{Um - uo) b{uo - Um) 

Step 3. The remaining terms in (4.55) and (4.59) cancel when Mm{uo, u) + Nm{uo, u) = 0 
for all 1 < m < M, that is, when 


b{u 


mj \ c{Um Uq) b(uo Um) "TT u(Un Um) b(^Um Un) 


^(Um) 


n 


b{Um Uq) c(uq Um) ^(Un Um) b(Um Un) 

n^m 


1 < m < M . (4.63) 


Results. Notice that (4.62)-(4.63) have the same form as Am and the bae found in Sec¬ 
tion 3.3. Moreover, the left-hand side of (4.63) matches with that in (2.38) and in (3.19) 
when (4.51) holds. To make contact with the results obtained through the CBA we use the 
parametrization (4.10). 

Exercise 4-11. Using some trigonometric identities, show that (4.62) precisely matches with 
(3.16) if we recognize a{uo) = a, b{uo) = b, c{uq) = c and b{um)/a{um) = Zm- 
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Exercise 4-12. Check that (4.63) now reduces to 



n^m 


(4.64) 


and that this correctly reproduces (2.45) when the spectral parameters are identified with 
rapidities via 

Um = -(Am + i7/2) • (4.65) 

Now let us use the trace identities to compute the momentum and energy of the Bethe 
vectors. Notice that the second term in (4.62), and almost all of its derivatives, vanish at u* = 0. 
By (4.14) the momentum of the Bethe vector (4.52) is 


p{u) 


ilog 


Km{u^,u) 

a{u^)^ 


M 

= i X] 

m=l 


b{Um) 


M 

^ P{Um) , 
m=l 


(4.66) 


nicely generalizing (4.51) to the M-particle sector. 

Exercise 4-13. Use (4.15) to check that the energy of \'^m',u) is given by 


isiny x-i ^ 

£:m[u) = —— Am[u^,u) -— 

Z CfUQ 


M 


ei{u) = 


isiny b{u) (a{u) 


a{u) \ h{u) 


UQ=U^ 

sin 7 


Am{uo,u) = ^ ei(um) , 
p{u) , 


m=l 


(4.67) 


and plug in (4.65) to check that this agrees with (2.44). 

In the framework of the QISM it did not require much effort to derive these results even for 
an arbitrary M-particle sector. A comparison with the amount of work needed to obtain the 
same results using the CBA in Appendix B goes a long way to justify the abstract algebraic 
machinery developed in the previous subsections! 


Rational limit. Let us briefly turn to the isotropic limit A — )• 1. Notice that the paramet- 
rization (3.24) yields a = b and c = 0 as y —)• 0. Thus the Lax operator (4.5) reduces to the 
trivial operator psinh(M)l in this limit. This is directly related to the issue pointed out in 
Exercise 2.20 at the end of Section 2.3. To study the isotropic limit one can use the paramet- 
rization obtained from (3.24) by rescaling rt = yu' and p = 1/y before taking y —?■ 0. Dropping 
the primes we find that the result is rational in u, and in our case even linear; 

a{u) = u + i , b{u) = u , c{u) = i . (4.68) 

The Lax operator (4.7) thus becomes a simple linear combination of the identity operator and 
the permutation operator (4.12): 


Lal{u) = ulal+l^al ■ 


(4.69) 
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The ii-matrix (4.31) acquires the same form in the isotropic limit. 

From the XXX YBA it can be shown that on-shell Bethe vectors are always highest weight: 
they are annihilated by the total spin-raising operator, = 0 , by virtue of the bae; 

see e.g. [3, §4]. The su(2)-descendants in the spectrum are obtained by applying S~ to the 
Bethe vectors. The xxx spin chain is analyzed using the QISM in [27, §3]. 

Exercise 4-M- Check that for Rab like in (4.69) the i2TT-relations (4.27) can be written as 

[u - v) [Tij{u),Tki{v)] = i {Tkj{v)Tii{u) - Tkj{u)Tii{v)) (4.70) 

where Tii(u) = A{u), Ti 2 {u) = B{u), T 2 i{u) = C{u), T 22 {u) = D{u). 


More spin chains. To conclude this section we show how the QiSM allows one to define new 
quantum-integrable spin chains. In Section 2.1 we looked at spin chains whose interactions are 

i) only nearest neighbour; 

ii) homogeneous (translationally invariant); and 
hi) at least partially isotropic. 

The xxx and xxz magnets are the main examples of such models. Let us briefly recall where 
properties (i)-(iii) were used in the analysis of these spin chains. Property (iii) was necessary 
to define the M-particle sectors, forming the starting point for both the CBA and the ABA. For 
the diagonalization of the Hamiltonian in the one-particle sector property (ii) came in handy, 
directly yielding the magnons (2.20). Property (i) was also important for the CBA, leading to 
the Bethe wave function (2.29). 

In the present section we have seen how, starting from the Lax operator (4.2) for the xxz 
model, via the monodromy matrix (4.16) one obtains the Yang-Baxter algebra that allows one 
to solve the model via the ABA. The latter still crucially depends on (iii), but it is possible to 
relax the other two properties in such a way that we can still use the ABA to solve the resulting 
models. 

Twisted boundaries. The periodic boundary conditions can be modified (‘deformed’) to 
allow for quasi-periodic or twisted boundary conditions, Si+l = exp(4i'dcr^) Si exp(—^i^?, 
where the twist parameter is 27r-periodic, cf. exp(±7ricj^) = — 1. Such boundary conditions 
are accounted for in the QISM by introducing a twist operator 

Kai'd) := exp(4ii?fj^) = diag(e^’’/ 2 ^ ^ „-,-, ^ End(K) ■ (4.71) 

Although this operator breaks the full isotropy group when one starts with the xxx spin chain, 
the partial isotropy subgroup U{l)z C SU{2) corresponding to the ice rule, cf. (4.6), is preserved: 

[Ka{'&)Kb{'&),Rab{u)] = [exp 5 ii?(fT^ -k al),Rab{u)] = 0 . (4.72) 

This implies that the twisted monodromy matrix 


r,(n;i?) :=Kaid) Lai{u) 


A 


E End(K ® U) 


(4.73) 
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satisfies the i?TT-relation for the same i?-matrix (4.31), so one can use the ABA to diagonalize 
the twisted transfer matrix t{u\i9) = tra Ta{u] -d) = D{u). 

Exercise 4-15. Extend the results from Sections 4.1, 4.2 and 4.3 to the case of twisted boundary 
conditions: compute Hq and i^i, use (4.72) to verify (4.25), check whether the relations (4.37) 
and (4.40)-(4.41) of the yba are modified, and compute the eigenvalues and bae for the Bethe 
vectors (4.52). 

Inhomogeneities. Translational invariance can be broken by considering Lax operators 
Lai{u] Hi) := Laiiu — Hi) that depend on inhomogeneity parameters Hi ^ C. Integrability is 
preserved since the shifted arguments do not affect the FOR (4.32) in an essential way, and the 
same i?-matrix (4.31) does the job. As the shifts generically differ from site to site, however, this 
time there is no value n* of the spectral parameter at which all Lax operators become propor¬ 
tional to the permutation operator as in (4.11), and the Hk cannot be expressed in a nice way; 
in particular the Hamiltonian Hi does no longer involve only nearest-neighbour interactions. 
Nevertheless, one can still define the monodromy matrix as 

Taiu]ti) ■= n Lai{u- Hi) , (4.74) 

and proceed as before to diagonalize t{u] fx) = tr^ Ta{u; fi) = A{u] /i) -|- D{u; /i). 

Exercise 4-16. Extend the results from Section 4.2 and 4.3 to the inhomogeneous xxz spin 
chain: check if the relevant relations of the YBA are altered, find the vacuum eigenvalues (4.46) 
and (4.47), and compute the eigenvalues and bae for the Bethe vectors (4.52). 

Further generalizations. Other quantum-integrable spin chains that can be tackled using 
the QISM include models with higher spins (V) = see e.g. [2, §2.4, 3.4] or [3, §10]), open 

spin chains with reflecting boundaries (see e.g. [2, §3.5] or [28]), local spins that vary from site 
to site (obtained by ‘fusion’), and even exotic ‘spin’ where su(2) is replaced by any simple Lie 
(super) algebra [29]. 

5 Relation to theoretical high-energy physics 

In Section 2, 3 and 4 we have dealt with quantum integrability in the context of quantum 
and statistical mechanics. In this section we turn to QFT. The following (possibly biased and 
certainly incomplete) overview of applications of quantum integrability to QFT may serve as a 
motivation for studying quantum integrability for the more ‘hep-th’-oriented reader: 

• Exact S-matrix theory in 2d QFT is governed by a Yang-Baxter equation (ybe) for the 
two-body S'-matrix (Zamolodchikov-Zamolodchikov, 1979 [30]). See Section 5.1 and the 
nice lecture notes by Dorey [31]. 

• Also in two dimensions, conformal field theories (with central charge c < 1) possess a 
quantum-integrable structure (Bazhanov-Lukyanov-Zamolodchikov, 1990s [32]). 

• The high-energy scattering of four-dimensional Yang-Mills theories, such as QCD, ap¬ 
pears to exhibit hidden symmetries governed by quantum-integrable spin chains (Eaddeev- 
Korchemsky, 1995 [33]). 
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Moving up a notch and adding supersymmetry: 

• The gauge/ YBE correspondence interprets Seiberg duality for 4d AA = 1 quiver gauge the¬ 
ories as a Yang-Baxter relation for vertex models on those quivers (Yamazaki, 2013 [34]); 

• The Bethe/gauge correspondence relates supersymmetric vacua of certain M = 2 gauge 
theories to quantum-integrable models (Nekrasov-Shatashvili, 2009 [35-38]). This topic 
is introduced in Section 5.2. 

• The AdS/CFT correspondence has led to what is currently the largest and most active 
research area relating quantum integrability to high-energy physics. The case that is 
understood best is based on the following two ingredients: 

i) It is well known that the IIB superstring in the AdS^ x background is classically 
integrable. The one-loop corrected two-body S'-matrix satisfies a ybe, providing 
strong evidence that the theory is integrable at the quantum level too. 

ii) Correlators of gauge-invariant operators in planar 4d AA = 4 Yang-Mills theory can 
be computed using spin chains. 

Crucially, these two appear to yield exactly the same quantum-integrable structure, and 
there is a quantum-integrable model interpolating between the two sides, as is indicated 
by several nontrivial tests. See the big review [39], and [40] for a detailed account of the 
string-theory side. 

Let us also mention few more ‘math-ph’-oriented connections between quantum integrability 
and gauge theories: 

• Certain deformations of 2d Yang-Mills theory are related to exactly solvable statistical- 
physical models (Migdal, 1975; Rusakov, 1990; Witten, 1991). See the review [41] and 
references therein. 

• Chern-Simons theory and other 3d topological QFTs have quantum-group symmetries 
(Witten, 1989; Reshetikhin-Turaev, early 90s). See the review [42] and references therein. 

• Twisted deformed 4d AA = 1 gauge theories are in a similar way related to quantum- 
integrable models (Costello, 2013 [43]). 

Thus, quantum integrability also appears to have close ties to theoretical high-energy and 
mathematical physics. To get some feeling for how quantum integrability may appear in such 
contexts we turn to two examples: one old — the theory of exact S'-matrices and factorized 
scattering in two-dimensional QFT, which also sheds more light on the results from Sections 2.3 
and 3.3 — and one much more recent: the Bethe/gauge correspondence. 

5.1 Quantum integrability and 2d QFT 

Physics in two dimensions is special. This is well-known for conformal field theory, but more 
generally applies to two-dimensional QFT. For example, the spin-statistics connection no longer 
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holds in lower dimensions, in accordance with the results for magnons on a spin chain in Sec¬ 
tion 2.3. In condensed-matter physics this also shows up in two spatial dimensions: the rotation 
group 5'0(2) is abelian, so spin is not restricted to take half-integer values in the quantum the¬ 
ory, and at the same time there is a whole range of possible (‘anyonic’ or ‘braid’) statistics. 
The situation for statistics in one spatial dimension is even more peculiar, as is illustrated by 
bosonization.’^® In the relativistic setting of high-energy physics in spacetime dimension two, 
the little group ^©(l) C 50(1,1) is trivial, so (Lorentz) spin in does not even have an intrinsic 
meaning in 1 -|- 1 dimensions. 

Another notable feature of two-dimensional physics that is more relevant for us is, of course, 
the presence of quantum-integrable models. However, there is in fact no generally accepted 
definition of ‘quantum integrability’, so let us pause for a moment to think what this could 
actually mean. 

Motivated by the definition of Liouville integrability in classical mechanics (see below) one 
often would like to ask for a maximal set of commuting conserved quantities, such as the Hk 
in Section 3.3. However, for quantum-mechanical models with a finite-dimensional Hilbert 
space such a family always exists: like for any hermitean operator, the eigenstates \^k) of the 
Hamiltonian can be taken to be orthogonal; then Hk ■= |\I'fc) ® (T/jI does the job. In addition 
there is an issue with the number of independent operators, and hence with the notion of 
‘maximality’, see [45]. 

In practice, then, one often demands the existence of an underlying i?-matrix satisfying the 
YBE, see Section 4.2. This equation is closely related to some of the main results from Sections 
2.3 and 3.3: 

i) the xxz and six-vertex models have a tower of commuting symmetries; 

ii) the number M of magnons/occupancies is conserved in scattering processes; 

hi) such scattering is two-body reducible: it factorizes into two-body processes. 

Indeed, the discussion in Section 3.3 and 4.2 shows how the ybe is related to (i). In this 
subsection we explain how the YBE may appear in qfts with many conserved quantities in two 
dimensions, and how results (ii) and (iii) fit in. 

Classical integrability. Let us first summarize the situation in classical mechanics, where 
there is a well-defined notion of integrability. In the Hamiltonian framework classical mechanical 
models are described in terms of a finite-dimensional phase space, say with coordinates qi (posi¬ 
tions) and Pi (momenta), and Poisson brackets { •, • } between (functions of) those coordinates. 
The time evolution of any observable /(p, q) is determined by the Hamiltonian H{p, q) through 
Hamilton’s equation f := dtf = {H, f}. A familiar example is a collection of point particles 
moving on a line in a potential U{q), for which H{p, q) = YliPi/‘^'^ + the equations 

{Pi, Qj} = kj yield qi = pilm and pi = -diU{q), giving Newton’s mqi = -diU{q). 

Now, in brief, if a classical mechanical system has a maximal set of independent conserved 
quantities, and if those quantities Poisson-commute with each other, then the system is solvable. 
Liouville’s theorem makes this statement precise and provides a recipe for finding the solution. 

®For bosonization and other techniques in many-body quantum physics in one spatial dimension we refer to 
the book by Giamarchi [44]. In particular, bosonization can also be applied to the xxz spin chain, see [44, §6.1]. 
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Systems that can be solved in this way are called Liouville integrable and include the harmonic 
oscillator, Kepler problem and several spinning tops, see [9, §2] or the classic [46, §49-50]. 

The next step is the notion of a Lax pair, which opens up the road to the beautiful theory of 
classical integrable models. This theory contains the (semi)classical version of concepts that play 
an important role in quantum integrability, notably the classical r-matrix and classical Yang- 
Baxter equation as well as spectral parameters. Unfortunately this takes us too far from our 
topic, but see e.g. [9]. We suffice by mentioning that this story also has an infinite-dimensional 
incarnation in the theory of integrable nonlinear PDEs in two dimensions, like the Korteweg-De 
Vries (KdV) equation, for which there exists an infinite set of conserved quantities that can 
be obtained from a generating function called the monodromy matrix. These systems have 
solitonic solutions that can be obtained via the so-called classical inverse-scattering method, 
whose quantization is the QISM from Section 4, see also [8, §V]. 

The main message to take away is that in classical mechanics the existence of sufficiently 
many conserved quantities signals its integrability. What happens if there exist many conserved 
quantities in a field theory? 

Symmetries of the S'-matrix. One of the main goals in quantum field theory is to compute 
scattering amplitudes. All such amplitudes are contained in the S-matrix, which relates asymp¬ 
totic incoming states to all possible asymptotic final states: schematically S\i) = X^jS/j/), 
where the S-matrix entries Sj depend on parameters such as the momenta of the asymptotic 
states. The S'-matrix is a very complicated object and ordinarily one can at best hope to cal¬ 
culate its entries perturbatively. The presence of symmetries (and thus conserved quantities), 
however, may restrict the S-matrix to such an extent that its entries can be computed exactly. 

In many held theories the momentum is conserved. In addition there may be ‘internal’ 
symmetry operators, which commute with the Lorentz algebra o{d — 1,1), such as havour or 
gauge symmetries. These operators are also symmetries of the S'-matrix. Can the S'-matrix 
have symmetries that are not Lorentz scalars or vectors? In three or more spacetime dimensions 
the Coleman-Mandula theorem says the answer is negative [47]. More precisely: if a Lorentz- 
invariant theory has a mass gap and its S-matrix is analytic, under some technical assumptions 
the presence of any ‘forbidden’ inhnitesimal (bosonic) symmetry operator forces the 5-matrix of 
the theory to be trivial. Clearly such theories, having no interactions, are not very interesting. 

In spacetime dimension two, though, the Coleman-Mandula theorem no longer holds, and 
there are interesting two-dimensional theories, such as the sine-Gordon model, exhibiting (in¬ 
finitely many!) conserved charges transforming in higher representations of the Lorentz al¬ 
gebra o(l, 1) = M. The existence of such higher symmetries in a field theory does, however, 
severely constrain the 5-matrix of the theory. There are two important restrictions; let us take 
a look at each of them. 

Elastic scattering. The first restriction arises in the following way. A local operator Qg of 
‘Lorentz spin ’s acts on asymptotic states by multiplication with a polynomial, homogeneous of 
degree s, Ca,s (Pa )* ™ the lightcone momenta {p^ = p^ Pp^) of the asymptotic particles a. 
(For s = 1 the coefficients Cap all equal one and this is just the total momentum.) Conservation 
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of Qs thus implies 


(5.1) 


^pfy = Y1 ^pfy ■ 

iSin /Sout 

If, for N ^ N' scattering, the number of Qs with different s is large enough we get an overde¬ 
termined system of equations in the N+N' asymptotic momenta, and the solutions are trivial up 
to relabelling: the sets of initial and final momenta must coincide, {pi \ i G in} = {pj | / G out}. 

Now if there exist infinitely many symmetries Qs then it follows that there is no macroscopic 
particle creation or annihilation in such theories: N = N'. (Microscopically there may be 
contributions due to virtual processes where the number of particles are not conserved.) Having 
infinitely many conserved charges thus constrains the 5-matrix to be block-diagonal, with a 
block for each number N of asymptotic particles. In addition it follows that all scattering is 
elastic: the energy of each asymptotic particle is conserved in the process. 

Factorized scattering. The second remarkable consequence of the presence of higher sym¬ 
metries Qs in a two-dimensional field theory is that all blocks with N > 3 turn out to be 
determined by (^) two-body scattering processes. This phenomenon, known as factorized scat¬ 
tering, turns the computation of the full 5-matrix into a finite task, bringing exact results within 
reach. It can be understood as a result of the fact that higher symmetries act on particles by 
momentum-dependent translations; let us sketch the argument.^ 

It is convenient to parametrize momentum by the rapidity A via p^ = me^^, which takes 
into account the mass-shell condition = p'^p~ = m^. In a spacetime diagram (with time 
increasing upwards) the worldline of a free particle with momentum p is a straight line with 
slope A. Lorentz invariance implies that the 5-matrix entries only depend on rapidity differences. 
The two-body 5-matrix for iii 2 —)• / 1/2 corresponds to 

fl /2 

s/y“(Ai-A2) = V 

/Ai2\ 
ii *2 


Ai 2 := Ai — A 2 . (5.2) 


To see that this quantity completely determines the full 5-matrix of the theory let us consider 
three-body scattering with corresponding 5-matrix element 

fl fl /s 

\t/ 

‘5&5f(Ai-A2,A2-A3) = . (5.3) 

/l\ 

*1 *2 *3 

for the Coleman-Mandula theorem (see Witten’s [48, Lect. 4]): in three or more dimensions we can use 
a ‘forbidden’ symmetry to shift any two intersecting particle trajectories by momentum-dependent (and thus 
different) amounts to obtain two non-intersecting lines, so that there is no scattering. This argument fails in two 
dimensions since there two lines generically do intersect. 
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In a local field theory this scattering must happen in one of following ways, depending on the 
initial positions: 



*1 *2 *3 h h *3 h *2 h 

Now the higher symmetries at our disposal enable us to shift the worldlines by an amount 
that is momentum dependent and thus different for each of the lines. Thus we can pick suitable 
Qs’s to turn each situation from (5.4) into either of the others. The conclusion is that all 
three situations must represent the same physical process, so that (5.3) indeed factorizes into 

f f 

two-body processes. In addition we obtain a consistency condition for the entries of the 

two-body S'-matrix: 


fi /2 h fi /2 fs 




(5.5) 


This is the ybe in the context of factorized scattering. Further guaranteeing the consistency of 
the factorization of any Al-body scattering processes, the ybe is of central importance to the 
theory of factorized scattering and exact S'-matrices in two dimensions. For more about these 
topics we refer to the nice lecture notes by Dorey [31] and to [2, §1.1]. 

5.2 The Bethe/gauge correspondence 

Over the last two decades it is becoming apparent that gauge theories enjoying Af = 2 super- 
symmetry seem to come with an integrable structure for free. One class of such supersymmetric 
gauge theories was studied intensively by Seiberg and Witten [49] in the mid-1990s. It was soon 
realized [50] that at low energies these theories give rise to so-called classical algebraic integ¬ 
rable systems, closely related to (Liouville) integrable models from classical mechanics. These 
systems can be solved exactly, at least in principle, and such a low-energy classical integrable 
structure is a surprising and pleasant feature of the original gauge theories. 

An interesting question is whether this story also has a quantum-mechanical analogue: do 
there exist gauge theories which yield quantum-integrable models at low energies? In 2009, 
Nekrasov and Shatashvili showed that this question can be answered positively for a large class 
of supersymmetric gauge theories: this is the Bethe/gauge correspondence [35-38, 51]. 

This subsection provides a qualitative overview of the main idea presented in [35]. Rather 
than discussing the correspondence in full generality we describe what it entails for its main 
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example. Much more can be found in e.g. the following references. Nonperturbative QFT and 
SUSY are introduced in the book by Shifman [52]. A classic reference for supersymmetry is Wess 
and Bagger [53]. For two-dimensional M = (2, 2) gauge theories see Witten [54] and the book 
by Hori et al. [55]; see also the author’s MSc thesis [56, §3] and the references therein. Other, 
closely related, developments concerning exact results in AA = 2 gauge theories are reviewed in 
the recent series of papers by Teschner et al. [57]. The Bethe/gauge correspondence is introduced 
in [56, §4]. For a mathematical version of the Bethe/gauge correspondence see [58]. 

Rough version. We already know a lot about the ‘Bethe’ side of the correspondence. The 
‘gauge’ side of the story is about a very different area of theoretical physics, namely that 
of supersymmetric gauge theory. Although these arise naturally in string theory let us give 
another, more concrete, motivation coming from QCD. At high energies, the gauge coupling 
is very small, and the standard tools of perturbative quantum field theory are available: this 
is the asymptotically free regime. As we flow to the infrared, however, the coupling constant 
of the non-abelian gauge theory ceases to be small and perturbation theory breaks down. At 
the same time, the vacuum structure of a gauge theory determines the possible phases of that 
theory. Getting a grip on this low-energy regime is one of the great open problems in present-day 
theoretical physics. Out of the various approaches to try and overcome this problem we consider 
the following. Instead of studying QCD itself we shift our attention to its idealizations possessing 
supersymmetry (susy). This leads to a class of toy models that allow for more control whilst 
at the same time keeping several key features of QCD, providing an arena to test ideas about 
quantum field theory and non-abelian gauge theory in a controlled setting. Supersymmetry 
provides us with exact tools, so that a better insight can be gained into the structure of these 
idealized models. One may hope that some of this insight persists into the real world, where it 
may ultimately shed more light on QCD itself. 

Roughly speaking the Bethe/gauge correspondence amounts to the observation that 

There exists a class of SUSY gauge theories for which the (‘gauge-theoretic part’ of 
the) low-energy effective theory has the structure of a quantum-integrable model. 

Of course, this statement has to be supplemented with the specification which SUSY gauge 
theories this applies to. A more precise version of the statement is the following: 

For SUSY gauge theories with effective two-dimensional ‘AA = (2,2)’ super-Poincare 
invariance at low energies, the (‘Coulomb branch’ of the) SUSY vacuum structure 
corresponds to a quantum-integrable model. 

Let us illustrate these statements via the main example presented in [35], which features the 
XXX spin chain with Hamiltonian (2.7) on the ‘Bethe’ side. Before getting to the actual cor¬ 
respondence we take a look the ‘gauge’ side and describe the specific theories featuring in this 
example. 

Gauge-theory set-up. We begin with an ordinary non-abelian gauge theory with matter, 
rather like QCD, in 3 -|- 1 dimensions. The field content of the theory we want to study is as 
follows. The gauge group is G = U{Nc), so the gauge field describes Nc colours of gluons. 
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Next there are massive matter fields in the fundamental and antifundamental representations of 
the gauge group; these are the quarks and their antiparticles. Equally many of these fields are 
included, so that we have N{ flavours of quarks and antiquarks. This is similar to the situation in 
the Standard Model in the absence of the Yukawa couplings, where there is a flavour symmetry 
mixing the different generations of fermions. Finally we add one further massive field, living in 
the adjoint representation of U{Nc), these are like the VE-bosons in the Standard Model. 

The next step is to enhance the theory by making it supersymmetric. Although SUSY is a 
beautiful topic we suffice by saying that it is a boson-fermion symmetry, so each field now has a 
superpartner with the opposite statistics. The fields and their superpartners are nicely packaged 
together in representations of the SUSY algebra known as supermultiplets. For example, and 
its fermionic superpartner are contained in a ‘vector supermultiplet’, and each of the matter 
fields and their bosonic superpartners ‘chiral supermultiplets’, also in the (anti)fundamental or 
adjoint representation of the gauge group. 

So far the set-up is quite standard for SUSY gauge theories. Now we proceed towards the 
more specific situation that we need for our example; the number of spacetime dimensions has 
to be reduced to two. This can be arranged via a procedure known as dimensional reduction 
where we consider theories that are translationally invariant in two directions, so that we can 
restrict ourselves to physics in a (1 -|- l)-dimensional slice of spacetime, say the {t, zj-plane. At 
the level of fields this reduction is achieved by simply forgetting the dependence of the fields on 
the coordinates x and y. Although the result may seem pathological, non-abelian gauge theories 
in two dimensions still exhibit interesting phenomena such as asymptotic freedom, confinement, 
dimensional transmutation, and topological effects such as solitons. Thus, two-dimensional 
models provide a playground to learn about such aspects in an easier setting. 

The dimensional reduction has two consequences that are relevant for us here. Firstly, the 
original gauge theory the vector field has four components, describing two physical degrees 
of freedom corresponding to the transverse polarizations. As in Kaluza-Klein reduction, upon 
going down to two dimensions these components recombine into a two-dimensional gauge field 
with components Aq and Ai together with a complex scalar field a. Since there are no transverse 
directions in the reduced spacetime (there are no photons in two dimensions!), Aq and Ai do 
not correspond to physical degrees of freedom; therefore, the field a encodes the physics of the 
gauge field from the four-dimensional viewpoint. 

The second consequence of the reduction is that the amount of SUSY is enhanced, and we 
get M = 2 extended SUSY. This is a powerful tool for computations, bringing exact methods 
within our reach. In brief the reason for this enhancement is the following; SUSY operators are 
spinorial quantities, and the four real components that a Majorana spinor has in four dimensions 
recombine into two Majorana spinors worth of SUSY operators in two dimensions. In fact, in two 
dimensions, the Majorana (reality) and Weyl (chirality) conditions for spinors are compatible 
and can be simultaneously imposed, so that the minimal spinors in 1 -|- 1 dimension have only 
one component. In terms of the chirality of the SUSY operators we now have two left-handed 
and two right-handed such components; this is called ‘AA = (2, 2)’ SUSY in two dimensions. 

Goal: finding SUSY vacua. Even in lower dimensions and with extended SUSY, the full non- 
abelian gauge theory is still too complicated to solve. However, what can be computed exactly 
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is the effective theory in the infrared. When the energy is sufficiently low, the (massive) matter 
is effectively non dynamical, so the resulting effective theory is a pure gauge theory: it does no 
longer involve any matter fields. Our goal is to find the SUSY vacuum structure on the Coulomb 
branch, which is parametrized by (the vacuum expectation value of) the complex scalar held cr 
encoding the four-dimensional physical degrees of freedom of the gauge held. 

The low-energy theory is governed by a scalar potential whose zeroes are the supersymmetric 
vacua. Amongst others this requires a, which lives in the adjoint representation of U{Nc), to be 
diagonalizable. As a result the gauge group breaks down to its diagonal subgroup so 

that the low-energy effective theory constitutes W copies of electrodynamics; this is where the 
name ‘Coulomb branch’ comes from. Write am for the mth diagonal component of (the vacuum 
expectation value of) a, corresponding to the mth 17(l)-factor. The goal is to determine the 
values of the SUSY vacua am, also known as ‘Coulomb moduli’, through the vacuum equations 
for a = {ai,-■ ■ ,aN,)\ 


exp 



9Weff(q') 

dam 


= 1 , 


1 < m < W j 


(5.6) 


where Wes is known as the (shifted) effective twisted superpotential. Thus, the supersymmetric 
vacua on the Coulomb branch are similar to critical points of VFefj; the exponential is a peculiarity 
oi M = (2,2) theories in two dimensions. 


Approach: Wilsonian effective action. To find the effective theory in the infrared all 
matter fields have to be integrated out, as well as the higher modes of the gauge fields, to find 
Weff. This is where M = 2 SUSY comes in handy: by general arguments, it allows one to derive 
certain (‘non-renormalization’ and ‘decoupling’) theorems that highly restrict what can happen 
when fields are integrated out; in particular, Wes is one-loop exact. With the help of such 
theorems, the low-energy effective theory on the Coulomb branch can be computed exactly. 


Result: vacuum equations. When the dust has settled the vacuum equations for our theory 
turn out to be as follows: 


/ I ■ / O \ -^f I ■ 

/Um+l/ 2 \ _ -r-r (Tm — Cr^ + 1 

\am - i/2y am-CTn-i ’ 

n^m 


1 < m < Nc . 


(5.7) 


On the left-hand side each factor in the numerator comes from a one-loop diagram due a single 
flavour of quarks; likewise, the denominator is due to the anti-quarks. The product on the 
right-hand side is the contribution from the off-diagonal components of the gauge field. 


Guiding observation; dictionary. Recalling the discussion at the end of Section 2.3 it is 
clear that the vacuum equations (5.7) are precisely the same as the bae (2.42) for the M- 
particle sector of the xxx spin chain. This is the guiding observation behind the Bethe/gauge 
correspondence. Comparing (5.7) with (2.42) we arrive at the identifications listed in Table 3. 
Thus, the Bethe/gauge correspondence provides a dictionary which allows us to go back and 
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forth between the two sides of the correspondence. In this way, we may use our knowledge of 
one side to learn something about, or at least shed new light on, the other side. 


Bethe 

Gauge 

number of sites L 

number of flavours N{ 

number of magnons M 

number of colours 

rapidities Am 

supersymmetric vacua dm 

Yang-Yang function Y 

effective twisted superpotential Weff 


Table 3: Dictionary relating quantities for the xxx spin chain and the SUSY vacuum structure 
on the Coulomb branch for a Af = (2, 2) gauge theory in two dimensions. (The Yang-Yang 
function is defined in Appendix A; more precisely it matches with 27r Weff up to a shift.) 


More examples. Let us step back for a moment. So far we have looked at one particular 
SUSY gauge theory and found that its vacuum structure on the Coulomb branch corresponds to 
the M-particle sector of xxx spin chain. Is this just a coincidence? After all, one swallow does 
not make a summer. 

In the first two papers [35], Nekrasov and Shatashvili showed that the Bethe/gauge corres¬ 
pondence can accommodate for much more, and the dictionary can accordingly be enriched. 
The various integrable spin chains discussed at the end of Section 4.3 all fit in nicely: qua- 
siperiodic boundary conditions for the spin chain correspond to the inclusion of topological 
(‘Fayet-Iliopoulos’ and ‘vacuum-angle’) terms in the gauge theory; inhomogeneities and local 
spins straightforwardly correspond to different values of the (‘twisted’) mass parameters, im¬ 
plicit in the above, on the ‘gauge’ side; anisotropy is related to gauge theories in three or four 
dimensions, and more exotic types of ‘spin’ complement gauge groups other than U{Nc). 

Further examples of the correspondence, where the quantum-integrable models involve long- 
range (rather than just nearest-neighbour) interactions, were provided in [36]. Gauge theories 
with more involved Standard Model-like gauge groups fit in too [51], as do gauge theories on 
curved spaces [38]. The general pattern of the Bethe/gauge correspondence is the following: 

Consider a two-dimensional gauge theory, with 

• effectively, at low energies, two-dimensional A? = (2, 2) super-Poincare invari¬ 
ance 

• the appropriate matter content, and 

• suitable values for the parameters (implicit in the above), 

and determine the low-energy effective theory. Then the vacuum equations for the 

Coulomb branch coincide with the bae of a quantum-integrable model. 

Arguably this correspondence can be extended to encompass all quantum-integrable models 
that are solvable via a Bethe ansatz. 
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Achievements. Let us conclude with some words about the use of the Bethe/gauge corres¬ 
pondence. The ideas that we have described so far are of course nice and perhaps unexpected: 
they relate two seemingly very different areas of theoretical and mathematical physics. However 
it should be stressed that to the extent described so far this correspondence is not predictive. 
Indeed, in order to establish it one has to start with the appropriate models on both sides and 
calculate the bae and the vacuum equations in order to set up a dictionary between the two 
sides. Thus, a lot of information is needed as input, and one may wonder to what novel results 
it may lead. Here are a few examples of its early achievements: 

• In [36] the Bethe/gauge correspondence and related ideas were used to find so-called 
thermodynamic Bethe-ansatz (tba) equations for several long-range quantum-integrable 
models. These results later confirmed for the Toda chain with integrability techniques [59]. 

• Many new (quantum) integrable models have been obtained from ADE-quivei: gauge 
theories in four dimensions (in the H-background) [60]. 

• Dualities between supersymmetric gauge theories have been used obtain relations between 
different quantum-integrable models [61]; some of these are already known in the integ¬ 
rability literature but others appear to be novel. 

Thus, in short, the Bethe/gauge correspondence provides a way to translate knowledge about 
one side into statements about the other side, leading to new insights. 

A Completeness and the Yang-Yang function 

In Sections 2, 3 and 4 we have seen how, for a finite system with periodic boundary conditions, 
the coordinate and algebraic Bethe Ansatze lead to the bae. For the eigenvectors in T-Lm C Ti 
these consist of M coupled equations determining the allowed values of the parameters in the 
Bethe ansatz. An important question is whether the Bethe ansatz is complete: does it give all 
(^) independent eigenstates in 'Hm'I Note that the problem is more complicated than ‘just’ 
trying to count the number of solutions to the BAE: not all such solutions necessarily lead to 
physically acceptable states; in particular, the resulting Bethe vectors must be normalizable, 
i.e. have nonzero norm. The three possible situations are illustrated in Figure 10. 

The issue of completeness was already investigated for the xxx model by Bethe [21, §8], 
but it has remained a topic of debate to the present day, with approaches ranging from heavy 
numerics to combinatorics and algebraic geometry, see e.g. [62] and references therein. To 
understand why, notice that symmetries lead to degeneracies, while counting becomes harder 
by the presence of degeneracies in the spectrum. Indeed, since the Hamiltonian is hermitean, 
its eigenvectors are orthogonal for different eigenvalues. When there are no degeneracies in the 
spectrum, one can count the number of distinct eigenvalues to find the number of eigenvectors. 

From this point of view it is not surprising that the situation is better for the case with 
only partial isotropy. In this brief appendix we introduce an important tool for the study of 
completeness for the xxz model: the Yang-Yang function. 
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Figure 10: The three typical cases in the completeness problem of the Bethe ansatz. The 
left shows the mcomplete case, where the BAE do not have enough solutions to obtain the full 
spectrum of the Hamiltonian. The right, instead, corresponds to the orercomplete case, where 
all eigenvectors are of the Bethe form but the BAE have more than 2^ solutions and it has to be 
determined which of those are physically acceptable. The middle depicts the intermediate case. 


xxz case. Consider the Bethe ansatz parametrized in terms of rapidities A G like at the 
end of Section 2.3; see also (4.65). In logarithmic form the bae for the M-particle sector read 

M 

LpjXm) = Q(An, A^) , 1 < m < M , (A.l) 

n=l 

n^m 

where I = (Ji, • • • , Im) are the Bethe quantum numbers, cf. (2.34), and the two-body scattering 
phase 0 depends on the rapidity difference and the anisotropy A = cosy, cf. Exercise 2.13 in 
Section 2.3. 

It may come as a surprise that the bae admit a ‘potential’; there exists a function 
Y: C such that (A.l) is equivalent to the extremality conditions 

= 0 , 1 < m < M . (A.2) 

uXm 

This Yang- Yang function or Yang- Yang action can be defined as 

y(A) :=lJ 2 Pi^m) - 27r ^ ^ - A^) , (A.3) 

m=l m=l m,n=l 


where 


p{X)= ipi/2{p)dp,, 0(A) = / (flip) dp, 


... 1 , sinh(A -b iys) 

(/j,(A) := vlog . . 

1 smh(A — 17 s) 


(A.4) 


Exercise A.l. Check that the solutions of the bae (A.l) are precisely the critical points of (A.3). 
Yang and Yang employed Y (A) to show that the bae admit a class of real solutions [20] . 

Theorem 3. When 0 < A < 1 the bae (A.l) have a unique real solution A G for any I. 
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Proof. The uniqueness is proven with a convexity argument, see also [ 8 , §11.1], which only works 
when A is real. Using 


v^UA) = t:7: 


—isin( 7 s) 


2 sin(7s) 


i sinh(A + i 7 s) sinh(A — i 7 s) cosh(2A) — cos( 27 s) 
it is easy to see that the Hessian matrix dmdnY is negative when 0 < 7 < 7 r/ 2 : then 


(A.5) 


ym ^ 


m,n=l 


dXmdXr, 


^ ^ ^ P 1 I 2 (Am) Vrn T n ^ ^ P\{Xm Xn) {Vm Vn) <0 (A.6) 


m=l 


m,n=l 


for any A, d G This implies that Y has at most one critical point, which, if it exists, is 

a global maximum. The existence of this critical point, however, requires some care: one has 
to show that the critical point does not run away to infinity, as happens e.g. for the convex 
function y{X) = — e^ on M. The proof can be found in [20, §4]. □ 

Now by construction the Bethe ansatz is symmetric in the Am, so solutions to the bae that 
only differ by a permutation (relabelling) of the Am correspond to the same Bethe vector; such 
solutions should only be counted once. Since the bae (A.l) are symmetric under simultaneous 
interchange of Am Xn and Im In-, this amounts to taking into account only Bethe quantum 
numbers / with 0 < /i < • • • < Im < L — 1. Moreover, by the Pauli exclusion principle, Bethe 
vectors vanish whenever two rapidities coincide, so in fact it suffices to consider 0 < /i < • • • < 
Im < L — 1. There are precisely (^) such choices for 7, and by the theorem there is a unique 
real solution A G for each of these. Since the corresponding energies (2.44) are different, 
this goes a long way towards a proof of the completeness for the regime 0 < A < 1 of the xxz 
spin chain. 

A few years later Yang and Yang applied the same technique to analyze the (simpler) one¬ 
dimensional Bose gas [63]; see also [ 8 , §1.2]. It is worth mentioning that Y(A) also features 
in Gaudin’s hypothesis, which says that the (square) norm of the Bethe wave function can be 
computed as the determinant of the Hessian matrix dmdnY, see e.g. [ 8 , §X]. 


XXX case. In the isotropic case the functions (ps in the Yang-Yang function have a simple 
primitive (cf. Exercise 2.20 at the end of Section 2.3), and the integrals in (A.4) can be performed 
explicitly: 


1 

/ ¥?s1a=i(^) d;U = Y [(A-I-is) log(A-I-is) - (A - is) log(A - is)] -b const , (A.7) 

where the constant on the right-hand side does not affect the bae (A. 2). However, the Yang- 
Yang function is no longer convex for A = 1, and Theorem 3 does not apply. However, when 
the degeneracies of the xxx spin chain are lifted by turning on ‘generic’ quasiperiodic boundary 
conditions or inhomogeneities (see the end of Section 4.3), completeness of the Bethe ansatz 
can be proven [64]. 
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B Computations for the M-particle sector 

In this appendix the CBA from Section 2.2 is worked out for the M-particle sector of the xxz 
spin chain to derive the results quoted in Section 2.3. Recall that the CBA 

'I/p(Z) = ^ A^{p) e'P^-' , h<---<lM (B.l) 

for the wave functions in the M-particle sector, 

\'^M',p) = ^ ^p{h,-" jM)\h,-" Jm) ^'Hm , (B.2) 

yields eigenstates of the Hamiltonian if the equations 

{1\H^xz\^m',p) = Em{p) ^p{l) , 1 < Zi < • • • < Zm < ■ (B.3) 

can be solved. Our goal is to find the energies Em{p)-, the coefficients and equations 

determining the values of the parameters p. Let us write G N for the number of pairs of 
neighbouring excitations in the configuration I of excited spins, i.e. Ni ;= #{Zn | ^n+i = ln + 1}, 
so 0 < < M — 1 (unless M = L = Ni). For example, Figure 1 from Section 2.1 has Ni = 1. 

In terms of this notation the strategy (cf. Section 2.2) is as follows: 

0. Compute the left-hand side of (B.3). 

1 . Solve (B.3) for the energy contribution sm{p) ■= Em{p) — Eq as a function of p by 
considering configurations I with well-separated excitations {Ni = 0). 

2. Solve (B.3) for the A.,^{p)/A(,{p) as functions of p by considering I with A^j > 1 . (Luckily 
it will turn out that it suffices to consider a single pair of neighbouring excitations.) 

3. Impose periodic boundary conditions to get the bae for the allowed values of p. 

As in Section 2.3 we put h = J = 1. At the end of this appendix we comment on the six-vertex 
case and give the strategy to derive the results given in Section 3.3. 

Warm-up: M = 2. It is instructive to work out the strategy for the two-particle sector before 
tackling the general case. 

Step 0. Expand the vector \'i> 2 ]p) via the coordinate basis of 'H 2 as in (B.2): 

\^AP) = ^ 'I'p(Zi, Z2) |Zi, Z2) £ LZ2 • (B.4) 

We compute iLxxz|Zi,Z 2 ) using (2.4) and (2.9). The first two terms in (2.9) give 

5'^5'^+i|Zi, h) = |Zi ± 1, 12) + |Zi, ^2 T 1)) where one of the two vectors on the right-hand side 

vanishes when the excitations are next to each other. The third term, multiplies 

|Zi, h) by (L/4 — 2) if li and I2 are well separated, and by (L/4 — 1) if they are neighbours. 
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Step 1. In the well-separated case, Ni = 0, (B.3) gives 

2e2{p) ^pih, h) = 4 A ^ 2 ) — ^p{h — 1, h) — 'hp(^i + 1, h) 

These difference equations have to be satisfied by the wave function for all pairs li < I 2 — 1- 
Plugging in the CBA 

^PuP2{lul2) = Ae{pi,P2) +A^{p,,p^) e4pd2+P2h) ^ ^ (b.6) 

immediately yields the result 

e 2 (pi,P 2 ) = 2 A - cospi - C 0 SP 2 = ei(pi) + £ 1 (^ 2 ) • (B.7) 

2 S2{p) ^p{l-,l + 1) = 2A'hp(/,/ -|- 1) — ^p{l — 1, / -|- 1) ~ 'hp(^) / + 2) . (B-8) 

These equations can be solved using a trick exploiting the similarity between (B.8) and (B.5). 

Exercise B.l. Check that (B.5) is satisfied by Tp from (B.6) and £ 2 {p) from (2.31) independently 
of the values of h and l 2 - 

In particular, given (B.6) and (B.7), (B.5) holds true even when Ni = 1. When ^2 = + 1) 

however, the right-hand side of (B.5) features wave functions with equal arguments, which are 
not defined. Although such Tp(/,1) are not physical — they do not enter (B.4) — the trick is 
to extend the Bethe wave function to the diagonal h = I 2 using the formula (B.6). In this way 
nothing changes for Ni = 0, while (B.5) with Z 2 = + 1 gives extra information that we can 

use to solve (B.8). 

To see if the so-extended CBA does indeed satisfy (B.8) we subtract (B.5) with ^2 = + 1 

from (B.8) to get 

2ATp(/,/-|-l) = Tp(/,/)-|-Tp(^-|-l,/-|-l). (B.9) 

Up to an overall normalization the coefficients in the CBA (B.6) are determined by these equa¬ 
tions: (B.9) is satisfied if the two-body 5-matrix (2.27) is the 1x1 matrix given by the res¬ 
ult (2.32) from Section 2.3. 

Step 3. We already obtained the bae for M = 2 in Section 2.2, see (2.28). 

General M. We carry out the strategy following 1[1, §8.4]. The reader is advised to compare 
each step with the corresponding step for the case M = 2. 

Step 0. The equations (B.3) are computed for the M-particle sector as for M = 2. The 
result can be compactly written as 

2eM(p)^p(0 = 2(M-iVi)ATp(0- , (BTO) 

k 

where the prime indicates that the sum runs over the 2 (M — Ni) configurations k obtained 
from I by letting any single excited spin hop to an unexcited neighbour. 

Exercise B.2. Check that (B.IO) contains both (B.5) and (B.8) for M = 2. 
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(B.ll) 


Step 1. For = 0 we have to solve 

2 SMip) ^p{l) = 2MA5'p(0 - ^p{k) , 

k 

where the sum runs over 2M configurations. Plugging in the CBA (B.l) it is easy to see that 
(B.ll) is satisfied provided the energy contribution is given by the result (2.36) quoted in 
Section 2.3. 

Step 2. Here the real work begins. Repeating the trick of extending the CBA to h < h ^ 
■ ■ ■ < Im the equations for Aj = 0 can again be used to simplify those for Ni > 1. Indeed, 
the extended ^p{l) satisfies (B.ll), still involving a sum over 2M configurations, also for all 
(nonphysical) I with at least one pair Im = Im+i- Subtracting that equation from (B.IO) we 
obtain 

2Ni A^p{l)=jY'^p{k) , (B.12) 

k 

where the sum now runs over the 2Ni configurations obtained from I by moving one excitation 
in any single pair of neighbours on top of the other one. For example, when Ni = 1, say with 
In+i = In + 1) (B-12) boils down to a simple generalization of (B.9): 


2 A , • • • , Zn,, fn, -|- 1, • • •, Im ) — ^p(^l : ' ' ' ilni^ni ' ' ' 1 ) 

+ ^p{h, • • • ; + 1 ) + 1 ; • ■ ■ ) ^m) ■ 


(B.13) 


Usually (B.12) contains many more equations than there are unknowns (the functions 
and the values of p), so our task seems daunting. Luckily the equations that we have to solve 
are simplified by the following observation, exploiting the similarity between (B.12) and (B.13). 
Suppose that we would be able to solve (B.13) not just for Ni = 1, but for any Ni > 1. 
Because (B.12) can be recognized as the sum of Ni copies of (B.13), one for each pair of 
neighbouring excitations, then all other equations in (B.12) would automatically be satisfied as 
well! Although this observation does not change the number of equations that we have to solve, 
the new equations all have the same form, which moreover is very similar to that of (B.9). 

Thus we focus on (B.13) for some 1 < n < M — 1, where I may or may not contain additional 
pair of neighbours. (The special case n = M, for which Im = L is next to h = 1, corresponds 
to periodic boundary conditions. This is tackled in step 3 below.) 

Exercise B.3. Let us abbreviate k := (/i, • • •, • • •, Im)- Plug the CBA (B.l) into (B.13) to 

find 


E KP.W,P.(n+i))^.(p)e‘^--'' = 0, s{p,p'):=l-2Ae^P'+e^(P+P'^ . (B.14) 

tt&Sm 


Not all TT G Sm, are independent. Indeed, since kn = kn+i these exponentials only 

contain the quasimomenta and PTr{n+i) iri the combination p.,^(^n) + P-n{n+i)- Writing := 
(n, re + 1) G Sm for the transposition switching re GA re + 1, this means that when 

TT = tt' o Tn- Thus the terms in (B.14) come in pairs, and we find 


^TTTnip) _ 'S(P7r(n))P7r(n+l)) _ . 

A,iP} sfc(„+i).P.(„)) 


(B.15) 
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This is the M-particle generalization of equation (2.32) for the two-body S-matrix. In summary, 
for each l<n<M — Iwe obtain M!/2 equations (B.15). 

Any permutation vr G Sm admits a (non-unique) decomposition as a product of transpos¬ 
itions interchanging neighbouring pm, so repeated application of (B.15) allows us to express 
A^(p) in terms of products of two-body S-matrices and an overall normalization Ae{p), with 
e G Sm denoting the identity. The non-uniqueness of this decomposition leads to new compat¬ 
ibility conditions, which are always satisfied since the two-body A-matrix is a scalar quantity. 

Exercise B.4- Check that vr = (1, 3) G Sm can be decomposed as vr = ri o r 2 o n = T 2 o ri o t 2 . 
Carefully apply (B.15) to find A(]^ 3 )(p). Compare the result with Figure 2 from Section 3.3. 
Up to an overall normalization, the (unique) general solution of (B.15) is 


^Ap) 

Ae{p) 


sgn(7r) 

l<m<m'<M 


) P'K(m) ) 
siPm' 1 Pm ) 


S{PmiPm') ) 

(m,m')sinv( 7 r) 


(B.16) 


where inv(7r) := {1 < m < m' < M \ 7r(m) > TT{m')} is the set of inversions for vr. Thus we 
have derived the result (2.37). 


Exercise B.5. As a warm-up take n = Tn and check that (B.16) solves (B.15). For the general 
case verify the solution by splitting the product into four parts, depending on whether or not 
m and m' lie in {n,n -|- 1}. Use sgn(7r) = (—to verify the second equality in (B.16). 


Step 3. It remains to impose periodic boundary conditions on the Bethe wave function. 
Indeed, when working with the coordinate basis (2.15) for Bm a subtlety arises because of 
periodicity. To avoid linear dependence amongst the vectors in (B.2) we have ordered the 
positions of the excited spins as h < I 2 <■■■ < Im- However, the circle does not possess 
an ordering. In the above we have implicitly chosen representatives in {1,2,---,L} C Z for 
the sites Zm £ (in a more pictorial language: we have cut open the circle between sites L 
and 1). Of course we could have chosen to cut 'Ll at any other point; for example, a cut just 
after li corresponds to choosing representatives {Zi -|-1, Zi -|-2, • • •, -|- L} and yields the ordering 

h < h < ■ ■ ■ < Im < h + L. Independence of the choice of representatives for Ll thus requires 
1^2) • • • j Im, h+L) = \li, • • •, Im)- This just expresses the periodic boundary conditions Si+l = Si 
in terms of the coordinate basis. 

The upshot is that any vector Am) £ Bm may be expressed in terms of the co¬ 
ordinate basis (2.15) as in (B.2) provided periodicity is imposed on the wave function as 
^'(^M - ,Im-i) = ■ ■ ,1m) or, equivalently. 


■ ■ ,lM,h + L) = ■ ■ ,1m) , h < ■ ■ ■ < Im ■ (H-17) 


Write a = (12 • • • M) G Sm for the cyclic permutation. For the Bethe wave function (B.l) 
we get 'I'p(/ 2 , • • •, Im, h + L) = At^>{p) , where tt' := tt o Dropping these 

primes and equating coefficients in (B.17) we obtain Ml bae: 

^ ^ vr G 5 m . (B.18) 

\P) 
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Exercise B.6. Plug in the solution (B.16) for Check that factors with 2 < m' < m < L in 
the numerator cancel those with 1< <m<L — 1 in the denominator. Note that the result 
is symmetric in the PTr{ 2 )i''' so that the equations are the same for any two vr, tt' G Sm 

with the same value vr(l) = Thus it suffices to consider transpositions of the form 

TT = (l,n) G Sm- Check that this yields the bae (2.38) for the M-particle sector: 


gipmi _ 


M 


-!)"-■ n 


n=l 


s{Pn,Pm) 

s{Pm-,Pn) 


M 

S{Pn-iPm) ) 

71=1 


l<m<M . (B.19) 


Thus we have derived all results quoted in Section 2.3. 

Exercise B.7. Work out the CBA for the xxz in an external magnetic field, see Exercise 2.5 in 
Section 2.1, starting with the cases M = 0,1, 2 before attacking the general case. 


Six-vertex case. To conclude this appendix we briefly turn to the strategy to work out the 
CBA for the six-vertex model. Thus, the goal now is to use 

M 

K{.z) zj , zj := Yl , h<-- - <Im ■ (B.20) 

ttGSm m=l 

This produces eigenvectors for the transfer matrix provided we can solve the equations 

Y{l\t\k)l>^{k) = {l\t\lfM;z) = AMiz)l>z{l) , <Im <L . (B.21) 

k 


The left-hand side of (B.21) is much more complicated than its xxz-analogue in (B.IO). This 
requires more care in formulating the strategy for using the CBA in this case: 


0. Rewrite the left-hand side of (B.21) by summing geometric series in the Zm, 


N 




zN+l 


1-z 1-z' 


(B.22) 


1. Focus on the wanted terms, involving 2 ^* like the CBA, to find Am{z). 

2. Consider unwanted internal terms, containing {znZn+iY"^ or {znZn+iY"'^''^ ■ Demand that 
these cancel to get (B.14), with pm = —ilog(^;m) instead of pm, and proceed as for the 
xxz spin chain to find At^(z). 

3. Demand that the unwanted boundary terms, obtained in (B.22) when n = 1 or = L, 
cancel to get (B.18) in terms of the Zm- The bae for the allowed values of 2 are then 
found as above. 


The details can be found in [1, §8.3-8.4] (cf. the footnote at the end of Section 3.2). 

Exercise B.8. Compare this strategy with that for the xxz model keeping in mind Exercise 3.6 
in Section 3.2. 
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C Solving the fcr 


Theorem 2 from Section 4.2 tells us that if we can find an i?-matrix satisfying the fundamental 
commutation relations (fcr) then the corresponding transfer matrices commute, resulting in 
conserved quantities for the xxz and six-vertex models, cf. Sections 3.3 and 4.1. Consider two 
monodromy matrices depending on different sets of vertex weights, such that the corresponding 
Lax operators are given by 


Lai = “ 


(a \ 

b c 
c b 

\ a/ 


Lh 


al 


(a' 

b' 

c' 

V 


c 

b' 


(C.l) 


a'j 


with respect to the standard bases of Va®Vi and 14 (8) Vj, as in Section 4.1. The entries are 
vertex weights of two different six-vertex models. 

In this appendix we follow Baxter [1, §9.6-9.7] to obtain an i?-matrix solving the FCR 


Lab Lai Lhl — L^l Lai Lab j 


(C.2) 


or in graphical notation 


a 

b 




I 


I 


(C.3) 


As a byproduct we will find conditions on vertex weights w and w' necessary to get com¬ 
mutating transfer matrices: from (C.3) we will rederive the condition A(a, 6, c) = A(a',6',c') 
from Section 3.3 in the more algebraic setting from Section 4. 


Reduction by symmetries. Since 14 ® 14 ® Vz has dimension eight, (C.2) a priori consists 
of 8 X 8 = 64 equations: 


7 7 



It is reasonable to look for solutions Rab £ End(14(8)14) that preserve the two symmetries of the 
six-vertex model: the ice rule (line conservation) and spin-reversal symmetry. Thus we assume 
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that the i?-matrix is of the same form as the Lax operators (C.l) 


^ab — 



(C.5) 


where the three entries have to be determined. Let us emphasize once more that, both for 
the Lax operator and for the i?-matrix, the order of the ‘outgoing’ labels is reversed in the 
coefficients, see (4.3) and (4.23). 

Due to parity reversal these equations come in 32 equal pairs. Line conservation requires the 
occupancy number to be preserved; « + /? + /?' = ') + 8 + 5'. This further reduces the number of 
nontrivial equations to 10. Using parity reversal we may restrict our attention to the ten cases 
with at least two incoming occupancies. Simultaneously switching a -f-)- 7 , /3 -H- 5 and j3' -H- 8' 
in (C.4) yields 

a a 


8 

5 ' 



(C. 6 ) 


7 


7 


When we rotate this over 180° we precisely recover (C.4). But the vertex weights are invariant 
under such a rotation (see Figure 6 in Section 3.1), so it follows that the four equations with 
a = 7 , (3 = 8 and (3' = 8' are automatically satisfied while the six remaining equations come in 
equal pairs. Thus we are left with just three independent equations. Paying attention to which 
crossing corresponds to and Rab these equations read 


Rf ff \ f iJt 

ao c + cc 0 = 



6 a'c" , (C.7) 


ac' 6 " + c 6 'c" = 



6 c'a" , (C. 8 ) 


/ I iJvJf 
ac c CO 0 = 



f // 

ca a , 


(C.9) 
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Solving the FOR. Let us first show how (C.7)-(C.9) result in constraints on Lai and L^i that 
are necessary for the existence of an i2-matrix satisfying the FOR (and thus yield commuting 
transfer matrices). Eliminating the (doubly primed) entries of the i?-matrix we recover the 
constraint A(a, 6, c) = A(a',b',c') from (3.21), where A was defined in (3.18). Recall from the 
discussion following (3.21) that, for fixed value of the function A, the vertex weights of the 
six-vertex model are parametrized (up to an overall normalization) by the spectral parameter. 
Thus we are once more led to Lax operators depending on spectral parameters as in Theorem 2. 
In Section 3.3 we found the condition A{a,b,c) = A{a' , d) for commuting transfer matrices 
through the results of the CBA; presently we obtained it by purely algebraic methods! 

Now we turn to the R-matrix itself. Notice that (C.7) and (C.8) only differ by the position 
of the single and double primes, while (C.9) is symmetric in this respect. It follows that if we 
instead eliminate the (primed) entries of Lf^i from (C.7)-(C.9) we similarly obtain the further 
condition A(a, b, c) = A(a", 6", c”). Thus, again up to an normalization (which drops out of the 
FOR at any rate), the entries of the R-matrix differ from those of the two Lax operators only by 
the value of the spectral parameter w. Using the explicit parametrization (4.10) we find that 
(C.7)-(C.9) are satisfied provided that the spectral parameters are related by the difference 
property sinh(t(;) = sinh(rt — v), which holds for w = u — v. Explicitly we thus obtain 

a" = p" sinh(u — u -|- iy) , b" = p" sinh(u — v) , c" = p" sinh(i 7 ) , (C.IO) 

where cosy = A(a, b, c) = A(a', 5', c'), for the entries of the i?-matrix of the six-vertex model. 
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